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PREFACE 


The physical sciences have made such advances in’ the last 
twenty or thirty years that it would be difficult to name a single 
branch of human knowledge that has not been affected significant- 
ly, or even radically, as a result. Modern engineering is to a large 
extent the creation of physics. New ideas and facts discovered by 
physicists become an integral part of man’s knowledge of the sur- 
rounding world. The problem of the structure of matter, whose 
investigation previously appeared as a peculiar form of mental 
exercise, has come to the forefront in science as a most important 
task that is inseparably linked with the development of civilisation. 

A broad knowledge of physics is a necessity for the specialist 
working in any branch of science or engineering if he desires to 
comprehend the fundamentals of his field of knowledge and is striy- 
ing to take a creative part in its development. The task of a course 
in physics for students of a technical institute consists, therefore, in 
helping them to understand the physical basis of engineering. 

In addition to this main task, a course in physics in a technical 
institute should be organised in such a manner as to help the student 
to master experimental technique and acquaint him with equipment 
used to measure physical quantities. Skill in experimental physics 
is attained by working in the laboratory. It seems to us that famil- 
iarising oneself with experimental physics is a completely distinct 
task in the study of physics in technical institutes. The interweaving 
of experimental physics with the study of general physical laws 
and phenomena is only occasionally pedagogically justified. This 
is due to the fact that modern experimental physics cannot be 
sharply subdivided. The measurement of coefficients of expansion is 
accomplished with the aid of interferometry, radio equipment is 
required for experiments in mechanics and heat, and the investiga- 
tion of the structure of metals is inseparably linked with experiments 
in electricity. Physical experiments conducted with the aid of 
outmoded techniques are of interest only to specialists in physics 
desiring to trace the development of one or another experiment. It 
would probably be most expedient to arrange the curriculum in such 
a manner that laboratory work followed a course in general 
physics. 
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Thus, the author believes that lectures in physics, and conse- 
quently the corresponding textbook, should include only outlines 
of experiments, i.e., the goal of the experiments. 

Once agreed on the necessity for excluding experimental physics 
from our course, we must then choose between the inductive approach 
(from particular experimental facts to theoretical generalities) 
and the deductive approach (from theory to its experimental corro- 
boration and manifold applications). In a very extensive course, it 
is probably possible to combine these two approaches as they are 
linked in the development of science. This possibility was not open 
to the author and so the second approach was chosen. Presentation 
of the basic theoretical propositions, the deduction of corollaries 
that could be verified experimentally, and then the illustration of 
these experiments by means of diagrams—this was the approach 
adopted in practically every chapter of this book. Naturally, this 
meant that the historical method had to be completely disregarded. 
The history of the origin of ideas, the formulation and discard of 
physical theories, remained beyond the scope of this book, since it is 
written for the student who is not aiming to be a professional physi- 
cist. It seems to me that only such a method of presentation makes 
for clarity and conciseness. 

This approach is also dictated by the structure of the course. If 
the fundamental laws of physics are regarded as of paramount 
importance. and we carefully consider the body of facts pertaining 
to a particular complex of theoretical ideas, then it would be expedi- 
ent to separate the presentation of the strictly phenomenological 
theories and their corollaries from the problems relating to the 
structure of matter. Classical mechanics, thermodynamics, statisti- 
cal physics and electrodynamics are self-contained and to a large 
extent in themselves complete fields of physics. Their presentation 
should precede the consideration of the problems relating to the 
soucie of matter. It is expedient to consider the connection be- 
ween physical properties and the structure of matter after the funda- 
mentals of classical physics have been presented and after the 
theories of the structure of matter have been discussed. 

In many books on physics, problems related to the structure of 
matter and problems connecting the properties of matter with its 
structure are scattered through various parts of the course and to 
a large extent are lost, i.e., dispersed in other topics. It seems to 


me that this is not justified both from the point of view of the logi- 


cal structure of the course as well as from the viewpoint of the 
importance of these problems, especially if it is borne in mind that 
many institutions of higher education teach chemistry and technolo- 
gy, and specialise in matter and materials. The attention of the 
reader is called to the fact that problems related to the structure of 
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matter are discussed in a separate, concluding section. This is clearly 
the basic difference, as regards presentation, between this book 
and others. 

Another distinguishing feature of this textbook on physics is 
the integrated presentation of problems in radio physics and optics. 
This helps to give the reader a better understanding of the essence 
of physical phenomena. 

In view of the relatively short time allotted to a course in physics, 
one is forced to seek methods of presentation that are as condensed 
and concise as possible. At the same time, it is impermissible to treat 
superficially a number of problems in physics, the knowledge of 
which is a sine qua non today for every qualified chemist, technolo- 
gist and engineer. The reconciliation of these two contradictory 
requirements placed the author in a difficult situation. He decided, 
finally, to completely exclude repetition of all secondary-school 
material. However, this measure alone was insufficient to make 
room in this single volume for all the important problems of physics. 
To further reduce the size of the book, a concise presentation of the 
material was necessary. Some people feel that lectures and textbooks 
should include selected problems which are exhaustively discussed, 
and all that cannot be presented historically and logically, in 
complete detail should be excluded from the physics course. Such 
a viewpoint, it seems to me, is basically incorrect. A detailed knowl- 
edge of certain topics is not essential for the future engineer. At 
the same time, however, he must have a general notion of these 
topics, if only to be aware of the existence of these phenomena and 
laws. Therefore, on various occasions only the theoretical con- 
clusions have been presented and our knowledge in a number of 
branches of science is merely summarised, i.e., a detailed descrip- 
tion of the manner in which this knowledge was obtained has not 
been giyen despite its interesting and educational nature. 

At first glance, it may appear that the author, overlooking the 
fact that mathematics and physics generally are taught concurrently, 
has not employed mathematics wisely. Thus, the definite integral 
Symbol appears in our book before the student has studied the 
integral calculus. However, an understanding of the text does not 
require that the student be able to integrate, but that he merely be 
familiar with the concept of the definite integral. It would be inadvis- 
able and cumbersome to avoid such formulas in a physics textbook, 
and it would result in an impoverished presentation of the physical 
concepts involved. A more profitable approach is to introduce some 
minor changes in the curriculum with respect to mathematics, so 
that the students are initiated into the concepts of mathematical 
analysis early in the course. This suffices for the student to be able 
to read books using these symbols. The process of integration may be 
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studied considerably after the relatively easy concept of the definite 
integral has been assimilated. 

The interconnection between the various branches of physics has 
its effect on the presentation of theory and experimental results. 
If the lecturer desires to maintain logical presentation in the course, 
he must inevitably resort to repetition and cross-reference. There are 
innumerable examples of this. Thus, on the one hand, it is not 
desirable to separate the presentation of X-ray diffraction from 
optical diffraction. On the other hand, it is impermissible to disasso- 
ciate the latter problem from the problems of crystal structure. Or, 
as another example, it is difficult to treat problems of permittivity 
and dipole moments in separate sections, but at the same time it is 
not possible to discuss dipole moments without discussing other 
problems related to the structure of molecules. Furthermore, permit- 
tivity and coefficients of refraction must be discussed together. 
There is, of course, only one solution: in lectures, repetition is essen- 
tial (very useful, incidentally, for students); and in the textbook, 
recourse must be had to cross-reference. Owing to cross-referencing, 
a physics textbook cannot be read a single time in consecutive order. 
This also means that in order to properly understand one portion of 


the physics field, familiarity with the field as a whole is generally 
necessary. è 


A. Da ee 


PART ONE 
MECHANICAL AND THERMAL}MOTION 


CHAPTER I 


THE FUNDAMENTAL LAW OF MECHANICS 


1. Kinematics 


Equations of Motion of a Particle. If the dimensions and shape 
of a body are of no consequence in the consideration of a particular 
phenomenon, we can conceive of the body as being represented by 
a point. This approximate representation of a body by a material 


Fig. 4 


(i.e., mass) point is not only justified when the dimensions of the 
body are small relative to other distances considered in the problem, 
but is permissible whenever we are only interested in the motion of 
the centre of mass of the body. 

In order to describe the motion of a particle, one must indicate 
through which points in space the particle has passed and the © 
instants of time during which it was located at one or another 
point of the path. For this purpose, it is necessary, in the first place, 
to select a coordinate frame of reference (Fig. 1). The location of 

2* 
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a point in such a coordinate system, which in its simplest form 
is right-angled, is determined by the three coordinates x, y, 2, or 
by the so-called radius vector 7, drawn from the origin of the coordi- 
nate system to the given point* (Fig. 2). 

Thus, motion in space can be roughly described in the form of 
a table of values for v (each value being given by three quantities!) 
for the instants of time tı, tz, etc.; or accurately described in the 


Fig. 2 Fig. 3 


form of a continuous function 2 = f(t) Tin essence, three functions, 
eg., «= filt), y = fot); z= falt); or r=qi(t), a = p(t), B = 
= z(t); ete.]. 

The vector equation 1 = f(t) or, what amounts to the same, the 
three equivalent scalar equations are called the equations of motion. 

Average Velocity. Let us consider AB, a portion of the path. 
Assume that at the instant of time ¢ the moving particle was at A, 
and at the instant of time ¢ + At at B (Fig. 3). Let us introduce the 
radius vectors r4 and 7g. We know that during the interval of time 
At, the particle moved from A to B. It is therefore natural to call 


Se 
the vector AB the particle displacement vector. 


Vectors may be added by the parallelogram method. From 
Fig. 8, we see that 


=> > 
Tg=ra + AB or AB =r =r = Ar, 


* The radius vector is given by its magnitude, » = Vat + y? + 22, and 
the angles it forms with the coordinate axes: cosa =: , cos p= and 
F 


cos y= t Thus, it is determined by three quantities: z, y and z; or r, œ and 


P; or r, « and y; etc. (two angles determine the third, since cos? + cos?$ + 
+ cos? = 1). 
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i.e., the particle displacement vector is the vector difference of 
the radius vectors. The curvilinear motion is determined by the 
displacement vector Ax for time At, whereby the smaller Av, the 
greater the accuracy. 

The average speed for the path AB is given by the relation 
AB 
At ~ 
This is the speed at which the body would have traversed the dis- 
tance AB in uniform and rectilinear motion during the interval of 


time At. 
Thus, motion over the path AB may be specified by giving 


Vav = 


te 
the direction of the'vector AB = Ar and the speed Vav. In place 
of this, we introduce the vector 


> 
ees AB Ar 
Voor At? 


which is equal in magnitude to the average speed and whose direc- 
tion is that of the displacement vector. We can now say that the 
motion of the body over the path AB is determined by the average 
velocity. ? 

Instantaneous Velocity. If we decrease the interval of time At, 
the point B will approach point A. These points finally merge and 


F 
the direction of AB then coincides with the tangent to the curve at 


the point of merger. 
> 


2 AAG: aaah = 
As At decreases, the ratio >> approaches a limit. The vector 


Vinst, having the direction of the tangent to the curve at the 
given moment of motion and numerically equal to the limit of 


the ratio a as At > 0, is called the instantaneous particle velocity: 


vms = limit So when At — 0. 


In other words, the instantaneous velocity is the derivative of the 
vector # with respect to time: 
dr 
ars 

It should again be emphasised that it is not absolutely essential 
to employ vectors in order to describe motion. Instead of using 
the concept of vector velocity, we could speak of the absolute 


value of the velocity, lazi ,* and indicate the direction of motio. 
a 


indicate that only the absolute value (mo 


* The vertical bars || 1 the 
bars is being considered. 


of the vector between the 
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If we did this, however, the same rules and the same experimental 
facts would require more cumbersome and more wordy formula- 
tions. Vector notation corresponds to physical experience, and is 
moreover concise and expressive. A certain amount of effort, how- 
ever, is required to become accustomed to it. 

Since the projections of the vector a on the coordinate axes are 


the coordinates of its terminus, z, y and z, the projections of the 
velocity vector are: 


dz d; dz 


= sakes di. l 
Ke an a ea a oea 


Acceleration. To continue our consideration of curvilinear motion, 
let us draw arrows to represent the instantaneous velocities of the 
body in passing through the points A and B of its path. If we had 
not introduced the concept of velocity, we would have to describe 
the situation as follows: the speed at B is different from that at A; 
moreover, the direction of motion has changed. Using the concept 
of velocity, we can state more briefly: the velocity at B is differ- 
ent from that at A. 

Velocity can change in magnitude and direction. 

If the path AB is rectilinear, the vectors va and vg have the 
same direction. The change in velocity is obtained by arithmeti- 
cally subtracting the magnitude of the vector va from the magni- 
tude of the vector vp. i 

Let us now consider the curvilinear path AB; vectors va and vp 


differ in magnitude as well as 
in direction. To determine 
the increase in the magnitude 
of the velocity, it is neces- 
sary, as before, to subtract the 
magnitude of the vector ta 
from the magnitude of the 
vector Vg: 


Aļv|=|vs|— 


However, this quantity does 
not, of course, completely 
express the change that has 
Fig 4 occurred in the motion. 
Let us now subtract vector 
va from vector vp in accord- 
operating on vectors. Fig. 4 shows vector 
NEE an 


ance with the laws for 


aes See ee ee 
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Vector vp, the sum of Av-+va, is the diagonal of the parallelogram 
constructed on these vectors. 

Vector Av is called the velocity increment. The magnitude of 
this vector in the case of curvilinear motion is not A]v| = |vz| — 
—|v,|. From the figure, it is evident that the magnitude of the 
increment vector | Av | is greater than Aj v |, the difference in the magni- 
tudes of the velocities. To determine the velocity at point B, one 
must add velocity va and increment Av by the parallelogram method. 

We can now determine the acceleration for curvilinear motion 
as follows. The ratio of the velocity increment to the interval of 
time during which this increment takes place is called average 
acceleration: 


Av 
Gav = TE * 


When the interval of time At is decreased, this ratio approaches 
a limit. The vector 
ace ADE 
Ginst = limit 7p W hen At — 0 


is called the instantaneous acceleration of a body at a given moment 
of motion. In other words, acceleration is the derivative of velocity: 


and 
digg a 


The acceleration vector uniquely determines the nature of the 
change in the velocity of the body. 

Generally speaking, the acceleration. vector can form any angle 
with the curve. This angle determines the nature of the acceleration 
and the curvature of the path as follows. Through the point of the 
curve that is being considered, a circle is drawn that has a common 
tangent with the path of motion at this point, and for the given 
portion of the curve most accurately approximates it. This circle 
is called a tangential circle * and its radius p is called the radius 
of curvature at the given point. The acceleration vector is always 
directed into this circle. If the motion is accelerated, the vector @ 
forms an acute angle with 


the curve (i-e., with the tangent to the 
path at the given point). If the motion is retarded, this angle will 


evo is = eens 


* The tangential circle and the calculation of radius of curvature is 
studied in detail in courses on differential geometry. 


“ows. 
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be obtuse. Finally, if the magnitude of the velocity does not change, 
the acceleration vector is directed normal to the curve. 

These statements can be proved rigorously, but we shall merely 
illustrate them geometrically here (sce Fig. 5). 

In line with the above discussion, it is customary to resolve the 
acceleration vector into two components (Fig. 6): 


a= Az:+ An. 


Since the vector triangle is right-angled, 


a=Vaj+a. 


The vector a,, directed along the curve, represents the change in 
the magnitude of the velocity and is called the tangential acceler- 


4 


[say 


ation. It is not difficult to 
show that the tangential 
acceleration 


n Aa 
n = limit ———. 
a= limi AI when 


A d 
At— 0, i.e., a= ll 


where A|v| is the increment 
in the magnitude of the ve- 
locity. 

The vector @,, directed 
normal to the curve, repre- 
sents the change in the direc- 
tion of the velocity and is 
called the normal accelera- 
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tion. The normal acceleration a, is related by a simple for- 
mula to the speed v and the radius of curvature p at the given 
point, namely: 
v? 
ün =—. 
n o 


From this formula, which is derived in courses in theoretical mechan- 
ics on the basis of geometrical considerations, it follows that motion 
with a constant normal acceleration (a, and v constant quantities) 
is circular motion. In this case, p is a constant quantity for all 
points along the path and is equal to the radius of the circle. 


2 


The normal acceleration @, => is often also called centripetal 
acceleration. 

Centripetal acceleration of a body moving in a circle with radius 
R can also be expressed by means of the period 7’, the frequency v, 
or the angular velocity of this motion. Between these quantities 
and the linear velocity v, the following simple relations exist: 


2aR 1 
v=» v=o, ve and @= 


The last two formulas define the auxiliary quantities v and o. 
Thus, the centripetal acceleration for the motion of a body in 


a circle can also be written in the form: 


2 
an=0?R or an= mR, 
T2 
It should be emphasised that the everyday understanding of the 
word “acceleration” is much more limited than in physics. The con- 
cept of acceleration in physics includes retardation (negative accel- 
eration) and, what is most important, includes uniform motion if 
this motion is along a curved path. Only motion that is simulta- 
neously rectilinear and uniform is considered motion without 


acceleration. 

A proton in a modern accelerator moves in @ 
tion of the order of 101° m/sec?. The linear accel- 
~30 m/sec. The acceleration of a hockey hall 
is ~10 m/sec?. The initial acceleration of an automobile is 1-2 m/sec?. The 
angular velocity of the rotor of a turbogenerator is 314 rads/sec and, at a dis- 
tance of 0.5 metre from the axis of rotation, articles move with an accelera- 
tion of ~5 X 104 m/sec. The angular velocity of a bicycle wheel is 7- 
10 rads/sec and, at a radius of 0.5 metre, particles on the rim have a normal 
acceleration of about 20 m/sec?. 


2. Force 


Examples of acceleration. 
circle with a normal accelera 
eration of a modern rocket is 


interaction are known to the 


At the present time, three types of 
1 and electromagnetic forces 


physicist. The laws of gravitationa 
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have been thoroughly investigated, and nuclear forces are being 
intensely studied. 

Gravitational Force. The force of attraction between heavenly 
bodies, which was discovered by Newton and is otherwise known 
as gravitational force, acts between any two particles in accordance 
with the law 


mmg 
PEN, 
where y=5xX 10° dyne x cm?/gm?, m, and ms are the masses of 
the particles, and r is the distance between them. 

It can be rigorously proved, but we shall not do so here, that 
Newton’s law of gravitation written in the form valid for bodies 
having small dimensions (small with respect to the distance between 
them) is also valid for the interaction of a small body with a large 
sphere. The distance, here, is understood to be measured between 
the centres of the bodies. 

-The law of universal gravitation for the case of the attraction 
of a body by the Earth can, therefore, be written in the form 


M 
PY ape 
where h is the height above the Earth’s surface and R is the radius 
of the Earth. For points close to the Barth's surface, k is so much 


smaller than R that R+% may be replaced by R. Then, F = am. 
Comparing this formula with the usual expession for weight, F = 
=mg, we see that the gravitational acceleration may be expressed 


in terms of the gravitational constant, the mass of the Earth, and 
the radius of the Earth: 


Since the gravitational 


z force is proportional to the masses, it 
is very large for heavenl 


i y bodies and negligibly small for the ele- 
mentary particles. In the interaction between atoms, molecules and 
gane particles of matter, the gravitational force is of no significance. 
oe ronga of attraction between the Moon and the Earth is 
ae aye between the Earth and a molecule of oxygen 

ae oe a between two oxygen molecules that are touching 
each other A = 3° X 4058 cm), ~2 x 10-37 G Pe AG 
speak for themselves. : seater eae Neure 
Electromagnetic Force. Tf two rti i i 

. particles or bodies hav 2ctric 

charges qı and qz, there is ies have electri 


cline e is a force of attraction between them if 
1e charges are of opposite sign and a force of repulsion if the charges 


—— T 


_—- 
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are of equal sign. Quantitatively this relationship is expressed by 


’ q2 . . 
Coulomb’s law: F=“, As in the case of universal gravitation, 


this formula is valid for small particles. We shall show in 
Sec. 111 that magnetic force and electric force are intimately related. 
All electromagnetic interaction is of a single nature. 5 

The interaction between atoms, intermolecular forces, and the 
forces holding electrons about an atomic nucleus are all forces of 
electrical origin. In order to again demonstrate the negligible nature 
of the gravitational interaction between elementary particles, we 
compare gravitational attraction with the electric attraction 
between a hydrogen atom’s nucleus and its single electron: 


Fa=9x 10-3 dyne, while Fyrav= 4 X 10-7? dyne! 


At first glance, it may not appear understandable why the inter- 
action between neutral atoms and molecules is of electrical origin. 
We shall go into this in more detail in Chapter 29. However, we 
should note here that the forces between the atoms and molecules 
do not depend on the overall charge of the particles (which is equal 
to zero), but on the local concentration of electric charge. 

Since intermolecular force is of electrical origin, surface tension 
and all cohesion forces between bodies are of the same origin. 
Frictional force is also essentially based on electric interaction. 

The elastic force that is developed when rubber or a compressed 
metal spring is extended is due to interatomic and intermolecular 
interaction. 

Thus, it too, in the final analysis, is electromagnetic in na- 
ture. 

Nuclear Foree. There are forces between neutral particles in an 
atomic nucleus (also between a neutron and a proton and between 
two protons) that cannot be explained on the basis of electromag- 
netism. These forces decrease very rapidly with increasing distance 
between interacting particles. As a result, these forces do no exist 
beyond the bounds of nuclei and are evident only in connection 
with phenomena involving direct interaction of nuclei. 

Foree Field. The space in which gravitational force is effective 
is called a gravitational field. Similarly, we speak of an electro- 
magnetic field. Any particle acted on by a force field can also create 
such a field. Thus, every particle creates a gravitational field and 
is acted on by gravity; and every electrically charged particle 
creates an electromagnetic field and is acted on by an electromag- 
netic field. s 3 . s Š 

Thus, every interaction of particles is depicted in physics accord- 
ing to the scheme: particle—field— particle. The iay particle 
creates a field, and this field acts on the second particle. 


ee 
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The field created by particles is material. The properties of 
a field are essentially different from the properties of substance. As 
a result, it is often said today that matter has two forms—field and 
substance. The problems of the interrelation of field and substance 
are at present under intense investigation and cannot, as yet, be 
considered solved (see p. 555 for a more detailed discussion) 


3. The Fundamental Law of Mechanics 


Newton’s Laws. The fundamental law of mechanics is the rela- 
tionship found by Newton between the forces acling on a body and 
the acceleration acquired by the body under the action of these 
forces. This law is usually formulated for particles. This in no way 
limits the universality of the law, inasmuch as a complex body 
can be considered, in principle, as the sum total of all its particles. 
Moreover, Newton’s equation has extraordinarily broad direct 
application, since in most problems in mechanics we are either 
concerned with bodies having small dimensions or are interested 
only in the motion of the body’s centre of mass. 

The fundamental law of mechanics states the following. If the 
forces fa, f2, f3, etc., whose sum total is F= Xf 
the acceleration acquired by the body is equ 
tained by dividing the result 


» act on a body, 
al to the quotient ob- 
ant force by the mass of the particle: 
F ~ 
t= —. 
m 
The equation also states that the acceleration vector coincides with 
the direction of the resultant force. The constant of proportionality 
in this formula is assumed to be equal to unity, which, the student 
will recall from his earlier training, depends on the choice of the 
system of units for the quantities entering into this equation. 
The fundamental law of mechanics may also be written in the form 


> 


or TREO The latter equation is equivalent to the former only 
if the mass does not change during the motion. We shall adhere 
to this condition. The case of variable mass will be considered 
below. In Chapter 3, we shall briefly discuss the equation of motion 
for bodies of variable mass in the range typical for rockets and, 
ot napter 24, we shall consider the complications arising when 
moves with a spe v i ight i 

y ie oe $ PTE approaching that of light (mechanics 

The fundamental law of mechanics sho 
generalising observed facts. This equati 
derived from any simple ge 


uld be considered as a law 


‘tion cannot be theoretically 
neral considerations. 
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The law ‘of inertia follows directly from the fundamental law. 
Jf there are no forces acting on the body, the acceleration is equal 
to zero and the motion of the body is rectilinear and uniform. 

In applying Newton's fundamental law to a particular body, we 
focus our attention on this body and consider the forces acting on it. 
It should not be forgotten, however, that force is a measure of the 
interaction between bodies and that one-sided interaction does 
not exist. If one body acts on another, the latter also acts on the 
former. The measurement of force is equivalent to the measurement 
of interaction. Thus, the very method of measuring force assumes 
that the force of one body acting on another and the force exerted 
by the latter on the former are equivalent in magnitude. Since we 
are usually interested in one particular body, we focus our atten- 
tion on the force acting on it; the other force is called the force of 
counteraction or the force of reaction. The forces of action and 
reaction are equal in magnitude but are oppositely directed. This 
proposition has become known as Newton’s third law of motion. 

Relativity of Motion. A body at rest in one system of coordinates 
may appear to us, from another viewpoint, to be moving. The 
uniform motion of a person walking along the platform of a station 
will appear nonuniform if described in a system of coordinates 
based on a braked train. Therefore, when speaking about the law of 
motion, the frame of reference for which this law holds must be 
indicated. The system for which Newton’s laws are valid must, 
without fail, satisfy the following conditions: a body on which no 
forces are acting must move rectilinearly and uniformly or must be 
at rest. Such a. system is called an inertial system. 

Thus, it is evident that all frames of reference that are executing 
accelerated motion with respect to a body on which no forces are 
acling are not inertial systems. Another important conclusion that 
immediately follows is that there is not merely one inertial system. 
In fact, an infinite number of inertial systems exist. An inertial 
system can be based on any body moving uniformly and rectilinear- 
ly with respect to some particular body on which no forces are act- 
ing. 
Let us assume that an inertial system has been selected. Newton’s 
law, F = ma, is valid for any body moving in this system with 
a velocity v and acceleration @. Now, let us consider another frame 
of reference moving rectilinearly and uniformly wilh a velocity a with 
respect to the inertial system. To be sure, in this system, the same 
body will have a different velocity, equal to the difference between 
the velocity « and the velocity zw, the motion of the second system 
with respect: to the first. However, since the relative motion of these 
two systems is rectilinear and uniform, the acceleration of the 
body will be the same in both systems. Expressed mathematically, 


a, 
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since acceleration is the derivative of velocity and the derivative of 
3 ? du 
a constant quantity is equal to zero a= 9): 


dv d(v—w) 

B “dt dt E 

The acceleration of a body enters into Newton’s law, but the 
velocity does not. As a result, the fundamental law of mechanics 
is exactly the same in both systems. 

This important proposition, following from Newton’s law of 
mechanics, is called the principle of the relativity of motion. It can 
be summarised as follows: An infinite number of inertial systems 
exist, and, in such systems, the law of inertia and the F = ma 
law are satisfied. In this respect, none of these systems has any 
special advantage over the other systems. All inertial systems are 
equally suitable for the description of physical phenomena. 

The principle of relativity was first formulated by Galileo. 

Laws of Mechanics in a Noninertial System of Coordinates. Let us 
assume that the statement “acceleration is due to forces” 
valid in every system of coordinates. In noninertial systems of 
coordinates, a body executes accelerated motion even when it is not 
interacting with other bodies. But if this ‘is so, 
possess, in addition to forces due to interaction 
origin, i.e., forces resulting from the noninerti 
system. These additional forces are calle 
it would actually be more correct to c 
Since inertial forces do not result fror 
satisfy Newton’s third law of motion. 


We shall limit ourselyes to a simple example of inertial force, 
for in this book we do not intend to employ noninertial systems 
of coordinates in the analysis of motion. 

et us assume that, for certain reasons, it is cony 
a system of coordinates moving with an 


a constant magnitude and direction. 
uniformly wi 


eration —q@ 


is always 


noninertial systems 
, forces of different 
al character of the 
d inertial forces (although 
all them noninertial forces). 
m interaction, they do not 


enient to select 
acceleration a, having 
All bodies at rest or moving 
th respect to inertial systems will move with an accel- 
—@ in relation to the noninertial system selected. The 
acceleration —a@ is produced by a force —ma. 

This is the inertial force for the case under consideration. It is 


not the result of the interaction of bodies, but is due to the accelerat- 
ed motion of the reference system. 


If the body under consideration in the noninertial reference 
system interacts with other bodies, 


y the inertial force i ed to 
the forces due to interaction. Aoig add 
The fundamental law of mechanics j i i 
> amen } n noninertial s of 
coordinates is written in the form: peta 


ma = F + inertial forces, 


| 
| 
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where F is the resultant force due to the interaction of the 
bodies. 

The expression for the inertial forces will vary in accordance 
with the nature of the motion of the noninertial reference system 
(rectilinear, circular, circular with accelerated speed, ete.).. Formu- 
las for inertial forces in a variely of cases can be found in books 
on theoretical physics. 


4. Application of the Fundamental Law of 
Mechanics to Accelerated Rectilinear Motion 


In this section, we give several elementary examples illustrating 
the physical meaning of the fundamental law of mechanics, which 
states ihat the vector sum of the forces acting on a body is equal to 
the product of the mass of the body and the acceleration, and its 
direction is that of the acceleration. 

Horizontal Motion Under the Action of a Constant Force. An 
engine moves a trolley located on rails. Two forces act on the 
trolley in opposite directions—F,1, the frictional force exerted by 
the rails, and Fe, the force exerted by the engine. If these two 
forces are equal, the trolley moves uniformly. In order for the 
trolley to accelerate, the resultant force must be directed parallel 
to a. Therefore, to produce accelerated motion, the motive force 
must be greater than the frictional force. Moreover, the difference 
between these forces is the resultant force, which according to the 
fundamental law of mechanics is equal to the product of the mass 
and the acceleration. Thus, 


Fo —Fyt=ma. 


The frictional force is the result of the interaction of the rails 
with the trolley. Therefore, coupled with F, is the force exerted 
on the rails (7;,). Similarly, coupled with Fer is Fie, the force which 
the trolley exerts on the engine. 

The force Fi is the resistance force overcome by the engine 
(experienced by and acting on the latter). This is the force that 
would act on a man’s muscles if he were the source of motive 
power. As can be seen, the resistance force Fte consists of two terms: 
the frictional force and the quantity—ma, which can be called the 
inertial resistance. Inertial resistance is always related to the effective 
force acting on the accelerated body. It is equal to ma in magnitude 
but is oppositely directed to it. The inertial resistance could also 
be a single force acting on the accelerated body, which would be the 


case here if there were no friction. i A 
hat ua consider another example of horizontal motion under 
aero of a constant force. The load under consideration is placed 
ion of a cons 
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on a moving trolley having a back stop (Fig. 7). If there were no 
back stop, the load could slip off the trolley when the motion is 
accelerated. With no back stop, the fate of the load depends on the 
interaction between the trolley’s floor and the load. This interaction 
involves only friction. The trolley moves with a small acceleration @ 
and the force acting on the load, i.e., the frictional force, should be 
equal to ma*. But the static frictional force cannot have any magni- 


tude -whatsoevers It must be somewhat less than Pe Bhi 
ma > FF*, 


motion with acceleration æ becomes impossible and the load slides 
off the trolley. If there were no friction between the load and the 


ia 


Fig. 7 


floor of the trolley, the load would not mov 


e from its place, i.e., 
the trolley would move 


out from under the load. Let us now assume 
that the trolley has a back stop. The load is then prevented from 
sliding as soon as it comes into contact with the back stop. The 
back stop now pulls the load with the force F — ma. The force 
coupled with the motive force is the inertial resistance experienced 
by the back Stop. It is also equal to ma, but is directed oppositely 
to the acceleration and acts on the back stop. 

Examples of force. The force accelerating 
ton is the force imp 


a passenger car is ~200 kg = 


= 1,960 newtons (1 new arting an acceleration of 1 metre/sec? 
to a mass of 4 kg; 1 newton — 10° dynes = 


bythe ie A = 0.102 kg). The thrust developed 
by he jet engine of a modern aircraft is 10,000-20,000 kg = 105-2 x 105 new- 
ons and the tractive force developed by a T9-3 diesel locomotive is ~10,000 kg. 
Vertical Motion of an Elevator. Let us consider the forces acting 


pee ees located on the floor of an elevator executing nonuniform 


ae 


* In accelerated motion, 


Ba if some body is carried al ¥ 
friction (the body carried along is at rest with respect semi ense o 


30 froti z to th ier stat- 
ic frictional force will always have the direction of the E at 
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Assume the elevator is accelerated upward (Fig. 8). Two forces 
act on the load: the Earth’s force, Fr, and the force exerted by the 
elevator’s floor, Fa. But now the resultant force must be different 
from zero, so fy, Fa. Since the resultant force will be in the 
direction of the acceleration, Fe > Fy, and 


Foy — F y= ma. 


The force Fm is simply the gravitational 
force exerted by the Earth on the load. Thus, 


Fea — mg=ma. 


The magnitude of the force exerted by the 
load on the elevator, Fze, is exactly the same 
as Fa; thus, the resistance experienced 
by the elevator in lifting the load is 


ma 


Fe =mg + ma. Centre of gravity 
We see that this resistance consists of the 
weight of the load and the inertial resist- 
ance. The force Fre is sometimes called the 
apparent weight. 

The result obtained is for the case when 
the acceleration of the elevator is directed 
oppositely to that of gravity. This condi- Fig. 8 
tion is satisfied not only when the eleva- 
tor is being accelerated upward, but also when it is decelerated 
during its downward motion. 

When the direction of the gravitational force and the acceleration 
of the elevator coincide, the force exerted by the load on the ele- 
vator (apparent weight) is 

Fig=mg — ma. 


From this formula, it is evident that the pressure against the floor 
of the elevator ceases when 4 = g, i.e., when the elevator falls 
freely in the gravitational field. In this case, the body in the falling 
elevator ceases to press against the floor and stretch the cable, 
the body, so to speak, ceases to have weight. 

Freely Hanging Load. Let us consider the motion of 
ended from a trolley executing accelerated motion. 
the string by which the plumb bob is suspended 
vertical. Two forces act on the load: Fg, 
the tension on, the string, and Fr, the Earth’s attraction, which 
is equal to mg (Fig. 9). These forces are directed at an angle to 
each other. According to the fundamental law of mechanics, their 


31409 


i.e., 

Force on a 
a plumb bob susp 
For such motion, 
forms an angle with the 
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vector sum is equal to ma and its direction is that of the accelera- 
tion. The diagonal of the parallelogram formed by the forces Ps: 
and Fz, is, therefore, horizontal: 


ma = Fa + Fri. 


The force coupled with Fp: is exerted on the Earth and does not 
interest us. We are, however, interested in the force Frs, i.e., the 
force with which the load 
stretches the string. The value 
of this force, exerted on the 
string, is: 

Fs = —ma+ Fy. 


Thus, in this example too, 
the inertial resistance is a com- 
ponent part of the total re- 
sistance experienced by the 
accelerated body. 


5. Application of the 
Fundamental Law 
of Mechanics to Circular 
Motion 


Motion in a circle is accel- 
erated motion. If a body 
3 moves in a circle with constant 
Fig. 9 angular velocity, the magni- 
tude of its acceleration is equal 
ong the radius. 
in a circle, the body may be 
on of any number of arbitrarily directed forces. 
i ndamental law of mechanics, 
mply, the resultant force, must 


c us (parallel to the acceleration) 
alue of its magnitude must þe 


F 


N resultant force actin. 
the centripetal force. We again em i 

i; orce. a phasise that the resultant force 
always has the direction of the acceleration and not of the velocity, 


to œR and its direction is inward al 

While executing uniform motion 
under the acti 
However, i 
the vector 
be directe 
and the v 


mv? 
centrip = ERA. = mo?R. 
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force is to continually deflect the body from the rectilinear path 
along which it would move, as the result of inertia, if this force 
were not present. 

Example. A particle of mass m caught on a blade of a modern steam turbine 
(3,000 rpm, radius about 1 metre) experiences a centripetal force F = mo*r = 
= m X (314)? X 100 = m X 107 dynes (where m is in grams). The weight 


mx 107 


of the particle is equal to mg. Thus, the centripetal force is , or about 


10,000 times, the weight of the particle. 


If a body is given accelerated motion, then, in conformity with 
the law of action and reaction, the accelerated body should act on 
other bodies (constraints) that make it 
accelerate rather than move in accordance 
with the law of inertia. The force of the 
accelerated body on the constraint has 


been called the inertial resistance. Such Ye 
a force also exists, of course, for circular Uj Yj 
rer 


Nass 


motion—it is called the centrifugal force. 

Centrifugal force and centripetal force 
are equal in magnitude but are opposite- 
ly directed. The centrifugal force is 
applied to the constraints of a body exe- | 
cuting circular motion or, in other words, 
is applied to those bodies making the Fig. 10 
body under consideration move in a circle, 
and preventing it from moving rectilinearly and uniformly. As in 
the case of centripetal force, centrifugal force is a resultant—the 
sum of all the reactions exerted by a rotating body on its con- 
straints. 

Let us consider several examples, limiting ourselves to the simple 
case of circular motion due to the interaction of two bodies. If 
body A prevents body B from moving rectilinearly and uniformly, 
and makes it move uniformly in a circle, F 4p is the centripetal force 
and Fy, is the centrifugal force. Such simple interaction occurs 
between a body located on a bowlshaped pedestal, rotating about its 
axis in the horizontal plane, and the pedestal itself (Fig. 40). If 
the frictional force is not very large and the pedestal is rotating 
rapidly, the body slides to the wall of the bowl. In this case, the 
interaction between the body and the pedestal consists in the 
following: the wall of the bowl acts on the body inwardly along 
the radius (centripetal force), and the body, with a force of equal 
magnitude, presses against the wall outwardly along the radius 
(centrifugal force). 

We return now to the initial moment in this experiment. The 
body is lying on the pedestal and the pedestal has just begun to. 
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rotate. If there were no interaction between the body and the 
pedestal, the body would remain in place and the pedestal would 
rotate under the body. The presence of static friction prevents 
this from happening. The body rotates together with the pedestal. 
Moreover, as was indicated in the previous section, the static fric- 
tional force will be directed inwardly along the radius. The static 
frictional force is the only force impelling the body to rotate, i.e., the 

frictional force in this 


ee a case is acentripetal force. 
yee SEN Therefore, 
4 , aa 
jA F jp = Feentrip: 
: = 
The centrifugal force is 
5 ~S 


exerted by the body on 
the pedestal and is thus 
directed outwardly along 
the radius. If for the 
purpose of clarity (it 
should be remembered, 
however, that this is a 
very gross picture) friction is conceived as being due to the engage- 
ment of two rough surfaces, whereby the surface protuberances 
of the first body mesh with those of the second, then the cen- 
trifugal force constitutes a force acting outwardly along the radius 
at the meshed points of the pedestal surface. 

The frictional interaction maintaining the body fixed with 


respect to the pedestal must be less than a certain maximum F7", 
In increasing the velocity of rotation of the bowl, the value of 
mo°R finally becomes greater than F7"*, which makes it impossible 
for the body to execute circular motion with acceleration |@|= 
= oR. Indeed, to secure circular motion with angular velocity ©, 
a force mo? must act on the body. If the frictional interaction cannot’ 
provide this force and, hence, motion in a circle of radius R with 
angular velocity ©, the body moves with respect to the pedestal 
and static frictional interaction ceases to exist between the body 
and the pedestal. ; 

As soon as interaction between the body 
and the body becomes free, rectilinear and 
the velocity being th 


Fig. 14 


and the pedestal ceases 
uniform motion begins, 


E AN at possessed by the body at the moment of 
dissociation. Since the velocity of a body moving in a circle is 


directed along the tangent, this line represents the line of motion of 
the freely moving body. The tangential path of particles dissociated 
from a rotating body is very clearly demonstrated in the case of 
particles flying from a rotating grindstone. - 
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Let us now consider the rotation of a stone tied to the end of 
a string (Fig. 11). In order to uniformly rotate the stone at the end 
of the string under normal conditions, tangential acceleration as 
well as centripetal acceleration must be imparted to the body. The 
tangential acceleration is necessary in order to overcome the fric- 
tion with the air. The resultant acceleration—and, hence, the 
foree—is not directed along the radius, but forms an acute angle 
with the direction of motion. The hand executes rotational motion 
and the string is directed at 
each instant along the tan- 
gent to the circle described 
by the hand. 

As another example of 
circular motion, let us con- 
sider the rotation of two 
attracting bodies having the 
same angular velocity about 
a common centre. By means 
of a centrifugal machine, it 
is not difficult to make two 
bodies of equal mass, 
joined by a string, revolve 
about a common axis. 

To begin with, let us con- 
sider the first body, with a 
string attached to the rotating shaft. The centrifugal force acting 
on the shaft is equal to m ost. Similarly, the second body acts 
on theshaft with aforce m2@*R2. If these forces are equal, the strings 
could be joined to each other as shown in Fig. 42, for nothing 
would change thereby. It is thus clear that the condition for stable 
rotational motion of two bodies joined by means of a string is the 
equality of the centrifugal forces exerted on the string by these 
bodies: 


mo? Ry = m0" Ro. 
Thus, ` 


mo SF Ry? 

i.e., stable rotation takes place only when the ratio of the distances 
to the axis of rotation is inversely proportional to the masses of 
the bodies. Z 

The point dividing the distance R- R> in the ratio Re = oF 
(Fig. 12) is called the centre of mass (see Sec. 15). It can be stated 
that stable rotation of two joined bodies takes place about the 
centre of mass of the system. 
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We have spoken about two bodies whose interaction is achieved 
by means of a string. However, the above is also completely valid 
when two bodies attract each other in accordance with the law of 
universal gravitation, or when a positive anda negative charge 
attract each other. Thus, interaction of any kind between two 
attracting bodies can produce stable rotation about the centre of 
mass of the system. This interaction is given by two forces applied 
to the attracting bodies. The forces are oppositely directed but 
numerically equal. (At this point, the unsophisticated reader will 
usually ask: Why do the bodies not attract each other? We repeat: 
The forces are parallel to the accelerations, not to the velocities, 
and in circular motion the acceleration is directed along the radius 
toward the centre of rotation.) Since a single force acts on each 
body, both are centripetal forces. At the same-time, both are also 
centrifugal forces. Thus, body A acts as a constraint for body B, 
and vice versa. In other words, for body A, Fpa is a centripetal 
force while F4, is a centrifugal force, and vice versa for body B. 
However, the concept of centrifugal force is used here in a complete- 
ly formal sense. It was introduced only in order to emphasise the 
similarity existing between a system of spheres joined by a string 
and a system of bodies “joined” by a force of attraction. 

A planetary system is an example of stable rotation of attracting 
bodies. Let us assume that the Sun had only one planet—the Earth. 
The centre of rotation would then divide the line joining the Sun 
and the Earth in the ratio mgun : mgartn = 330,000 : 1. 

Thus, when it is ordinarily said that the Earth rotates about the 
Sun, we are not committing a serious error, and this would be so 
even if the Earth were the Sun’s only planet. 
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The motion of the Earth is complex. It revolves about its axis 
and, at the same time, moves in an orbit about the Sun. Hence, it is 
clear that the Earth does not constitute an inertial frame of ref- 
erence. Nevertheless, under conditions prevailing on the Earth, 
Newton’s law is generally quite satisfactory. In a number of cases, 
however, the noninertial property of the Harth’s frame of reference 
has an appreciable effect on the phenomena being studied. These 
cases should be investigated. 

Effect of the Earth’s Rotation on Its Form. Weight of a Body. 
If the Earth’s rotation is not taken into consideration, a body lying 
on the surface of the Earth can be considered at rest. The sum of 
the forces acting on this body would then be equal to zero. As a mat- 
ter of fact, any particle on the Earth’s surface lying at latitude @ 
moves with an angular velocity œ = 0.7292 x 10-4 sec"! about 
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the globe’s axis, i.e., in a circle of radius r = R cos ọ (R is the 
radius of the Earth, which is assumed to have, to a first approxima- 
tion, the shape of a sphere). Therefore, the sum of the forces acting 
on such a particle differs from zero. It is equal to the product of the 
mass and the acceleration, œR cos p, and is directed along r. 

It is clear that the presence of such a resultant force OG 
(Fig. 13) is possible only when the reaction of the Earth’s surface 
OA and thegravitational force 
OE are directed at an an- 
gle to each other. The body 
will then press on the Earth’s 
surface (according to New- 
ton’s third law) with a force 
OC = —OA. If the globe 
were at rest, this force would 
be equal to the gravitational 
force OE and would also 
coincide with its direction. 

Let us resolve the force OC 
into two components — one 
force directed along the ra- 
dius OD and the other along 
the tangent OB. As can be 
seen from the figure, the 
Earth’s rotation results in 
two effects. First, the weight 
(the body’s pressure on the 
Earth) becomes less than the 
gravitational force. Since OC ~ OD, this decrease equals DE = 
> mRw? cos*p. Secondly, a force is produced tending to flatten 
the Earth, i.e., shift matter toward the equator. This force 
OB = mRo? cosp sing. Such flattening has actually taken 
place, for the Barth’s shape is not spherical but close to an ellipsoid 
of revolution. As a result of this effect, the equatorial radius of the 
Earth is 1/300 greater than the polar radius. 

The flattening force tended to redistribute the mass of the globe 
as long as the latter’s form was not in a state of equilibrium. When 
this process was completed, the flattening force evidently ceased 
to be effective. Hence, the force exerted on the terrestrial “globe” 
is directed normal to the surface. 

Let us now return to the quantity expressing the pressure of the 
body on the Earth, i.e., to the physical quantity generally called 
weight. The calculation made for a sphere (gravitational force minus 
mR? cos? ọ) is naturally not valid for the actual shape of the Earth. 
However, for approximate calculations, this value can be used. 


v 
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At the poles (p = 90°), the weight of the body is equal to the 
gravitational force. Let us denote the gravitational force of the 
body at the poles by mg. As indicated above, the pressure exerted 
by the body on the Earth’s surface at any point of the globe, in 
other words, the weight of the body, will be equal to the difference 
between the gravitational force and the force DE, i.e., 


mg — mRo? cos? ¢ = mg’. 
Thus, 
g’ =g — Ro? cos? p 
is the acceleration with which a body falls at the latitude p. At the 
equator, g’ is 1/300 less than g. 
If we use an appropriate value for the acceleration of a freely 
falling body at each of the various latitudes, we do not have to 
calculate the effect of the 
Earth’s rotation on the weight 
of the body. i 
Effect of the Earth’s Rota- 
@ tion on the Motion of a Body 
on the Earth’s Surface. Let us 
assume that the motion ofa 
body is observed in a rotating 
system of coordinates. The 
body moves rectilinearly and 
uniformly past the observer, 
and the motion is curvilinear 


V in the selected noninertial 
frame of reference. Coriolis, 

-e the French scientist, showed 
cor by means of calculations that 


relative to a system rotating 

with angular velocity a 

Fig. 44 body moving rectilinearly and 
uniformly with velocity vhasan 

; acceleration equal to 2vo sin a, 

where œ is the angle between the axis of rotation and the direction 
of the rectilinear motion. The acceleration is directed perpendicular 
to the plane passing through the axis of rotation and the direction 
of the velocity. We may use the following rule to determine which 
of the two possible directions is that of the acceleration. If one looks 
along the axis of rotation in the direction that makes the rotation 
appear counterclockwise and places his left hand palm down with 
the fingers pointing in the direction of the rectilinear motion, the 
thumb will point in the direction of the acceleration (Fig. 14). 
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The Coriolis acceleration &cor acts on all bodies moving on the 
Earth’s surface. If one looks along the Earth’s axis from the North 
Pole, the rotation appears counterclockwise. Hence, in the Northern 
Hemisphere, any body moving rectilinearly relative to an inertial 
system will deviate to the right (as viewed by a terrestrial observer) 
in the course of its motion, while in the Southern Hemisphere, it 
will deviate to the left. This deviation could be large or small, 
depending on the direction of motion with respect to the axis and 
on the linear velocity of the motion. 

The deviation of the body can take place in the horizontal or in 
the vertical plane (with respect to the surface of the Earth). The 
Coriolis acceleration is directed perpendicular to the Earth’s axis; 
hence, the deviation taking place in the horizontal plane is greatest 
at the poles and equal to zero at the equator. The reverse is true 
for deviations in the vertical plane. The deviations in these two 
planes determine the corresponding projections of the acceleration 
vector. Thus, the projection of the acceleration of the body in the 


horizontal plane is 
2vo sin 9, 

where œ is the latitude. In the Northern Hemisphere, this projec- 
tion is directed to the right of the motion. k 

The deviation of bodies moving in the horizontal plane from 
their rectilinear path is the reason why the right banks of rivers 
are eroded in the Northern Hemisphere, and the left banks in the 
Southern Hemisphere. For the same reason, rivers in me Northern 
Hemisphere by-pass obstacles to the right and in the Southern 
Hemi ; ; eft. i 

e eee into regions of low pressure deviate from the 
radial direction to the right in the Northern Hemisphere (to the 
left, in the Southern Hemisphere) and form cyclones. gra ae 
in the Northern Hemisphere ae me fe ala counterclockwise, 
and i m Hemisphere clockwise. ; 

As PE erica deviation, a falling poe, agas ai 
fall exactly vertically. Such a body deviates Trom enst o weet ( ig 
Earth rotates from west to east, i.e., counterclockwise I viewe 


from the North Pole). 


Examples. 1. Let us calculate the maximum deviation oa aot ay 

shell, Tho deviation will be a maximum at the poles ( T ead ee ing 
irections @ = 90°). If we take the velocity of the are fo aa Sh 
Obtain a deviation sof 2 x-4,000 % 0a 402 st 0-40 ae oan 
tational acceleration is about 70 times greater than is pelete tics " pe can 
e seen, the deviation of the shell from its regtilinear path ca 

4 2 Ue ta eee CN eon wir to south (in the Northern 

, iver is Ho 9 e 

E mein & S v = 3 km/hr. Thus, the water moves from a region 


< 
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of small linear velocity of rotation of the Earth’s surface to a region of larger 
linear velocity. This increase in the velocity of motion (directed from west to 
east, together with the banks of the river) is determined by the Coriolis accel- 
eration and is due to the action of the right bank of the river on the mass of 
water. Let us calculate the Coriolis acceleration for the latitude p = 45°: 


aCor = WO9 Sin P, 
Oo = 2x rads/day =7.25% 10-5 rad/sec, v=3 km/hr=0.83 m/sec, 
@Cor = 2X83 7.25 10-5 0.707 = 850 x 10-5 = 8.5% 10-3 cm/sec?. 
Thus, on every ton of water the right bank exerts a force of 


410-3 
Sa AE xis gm=8.7 gm. 


The steep right banks of the Volga, Don and other big rivers of the Northern 
Hemisphere illustrate this effect. 


7. Data Necessary for the Solution of Problems in Mechanics 


The basic problem of mechanics is the determination of the motion 
for given forces. To determine the motion means to be able to 
indicate the location in space and the corresponding instant of time 
for any of the particles. If we are concerned with a complex mechan- 
ical system, then such data are necessary for each of the particles 
into which this system can be considered divided. 

In order to tackle such a problem, we must, in the first place, 
have complete data on the effective forces. The forces must be known 
for every particle and for every location of this particle. If these 
forces are known, then by means of Newton's equations we can 
determine the acceleration of the particle. However, Newton’s equa- 
tions of motion alone are insufficient to completely determine the 
path, the velocity and the instant of time corresponding to the 
passage through a given point in space. To describe the motion, 
it is necessary to know for each instant of time the location of the 
particle and the magnitude and direction of the velocity. In all, 
Six quantities must be given: three coordinates and the three projec- 
tions of the velocity on the axes. These data uniquely describe the 


ee state” of a particle and may be called the parameters 
of state. 


Thus, the problem reduces 


to the determination of the parame- 
ters of state, for 


Newton’s equations only give the acceleration. 
To solve the problem, the initial conditions must be known, i.e., 
the values of the parameters of state for some instant of time (this 
instant is usually designated by t = 0, whence the designation “ini- 
tial conditions”). If the initial values of the parameters of state 
are known, the rest is merely a matter of mathematics. Newton’s 
equations of motion plus the initial data suffice to uniquely solve 


Se 
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the mechanical problem. In principle, the future motion of the 
particle as well as the past motion can be established for any desired 
period of time. This concept amazed scientists at one time. Laplace, 
the great French scientist and thinker, once said: If we knew 
the initial coordinates and velocities of all the particles comprising 
the world, we would be able to predict the fate of the world. This 
somewhat naive viewpoint, reducing all reality to purely mechanical 
phenomena, is not valid in principle, and not merely because it is 
practically impossible to obtain the required data. Mechanics, based 
on Newton’s laws, has limited application and its conclusions cannot 
be applied that broadly. 

Let us return, however, to the six initial conditions. The need 
for giving precisely six quantities for a particle is evident from 
Newton’s equations themselves. 

The vector equation can be resolved into its components and 
written in the form of three equations: ma, = Fx, ma, = Fy and 
ma, = F. To determine the motion, it is necessary to establish how 
the particle’s three coordinates x, y, z vary with time. In order to 
establish the dependence of the coordinate 2 on time, we must 
integrate the equation 


dvx 
i Pixs 


The first integration enables us to find the z-component of the 
velocity. Upon integrating, we obtain the first constant of integra- 
tion. The second integration enables us to find the coordinate x as 
a function of time, and the second arbitrary constant is obtained. 
The above also holds true for the equations of change with respect 
to time for the other two coordinates. In all, six arbitrary constants 
are obtained. These may be determined only if six independent facts 
about the coordinates and velocities of the particle are known. 

As we have indicated, the initial conditions consist of the three 
initial coordinates and the three projections of the initial velocity. 
However, the problem could also be solved if six other quantities 
are known. For example, we may be given the three coordinates 
of the initial point, the numerical value of the initial velocity, 
and two coordinates of the final point. The path of the particle 
is also uniquely determined by these six conditions. 

The parameters of the particle may be given in a variety of 
ways. The location of the particle in space may be given by three 
Cartesian coordinates or by the distance from the origin of the coor- 
dinate system and two angles formed by the radius vector with 
the axes. Similarly for the velocity. 

A typical example of the dependence of a body’s motion on the 
initial conditions is the behaviour of a rocket fired from the sur- 
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face of the Earth. The trajectory of the rocket and its destiny is 
determined by the firing direction, the geographical Tocation of the 
launching site and the magnitude of the initial velocity. As is well 
known, for small firing velocities from the Earth, a body has a para- 
bolic trajectory. For a velocity of about 8 km/sec, equilibrium 
is achieved between the centrifugal force and the gravitational 
force, and the launched body may be placed in a circular orbit. 
For velocities between 8 and 11.2 km/sec, the launched body moves 
in an elliptical orbit about the Earth. At an initial velocity of 
about 11.2 km/sec, the kinetic energy of the body becomes sufficient 
to completely overcome the Earth’s gravitational attraction. 
A rocket launched with such a velovity will have a hyperbolic 
trajectory. 

If the mechanical system consists of n independent points, the 
number of parameters for the system will be equal to 6n. 

In some cases, however, constraints which serve to decrease this 
number may be placed on the mechanical system. A simple example 
is a centrifugal regulator, which may be considered as a system 
consisting of two joined spheres that can slide apart and turn about 
a common axis. It is clear that, given the distance of a point from 
the axis of rotation and the azimuthal angle with respect to an 
arbitrary line, we can uniquely determine the mechanical state of 
the system. Two “coordinates” and two velocities of change of these 
coordinates constitute the parameters of this state. 

Let us now consider an arbitrarily rotating solid body and deter- 
mine the data required to fix its position with respect to a station- 
ary system of coordinates. It is clear that the centre of mass of 
the body is determined by three quantities. To describe the body’s 
rotation, three angles suffice. We need not elaborate on this point, 
for it is evident that by means of three rotations about mutually 
perpendicular axes any desired orientation of a body can be achieved. 

Thus, the solid body requires twelve parameters—six coordinates 
and six velocities of change of these coordinates. 

As another example, let us consider two rigidly joined points. 
If they were free, six coordinates would be required to describe 


them, Since they are rigidly joined, an additional condition relating 
the coordinates of these points exists, namely: 


(z1— z)? + (yy — yo)? + (2, — 2»)? = const. 


Thus, five independent quantities are required to describe this system. 
In all, there are ten parameters—five coordinates and five veloci- 
ties of change of these coordinates. 


y; Since the parameters of state are always equally divided between 
coordinates” and velocities of change of the “coordinates”, it is 
customary to speak of the degrees of freedom of a system, whereby 
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we mean the number of independent coordinates required to describe 
the system. Thus, one point has three degrees of freedom, two 
rigidly joined points have five degrees of freedom, a solid body— 
six degrees of freedom, a system consisting of n independent points— 
3n degrees of freedom, etc. The meaning of the following proposition 
should now be clear: The mechanical state of a system is described 
by giving its parameters in terms of the number of degrees of freedom. 


8. Constants of Proportionality and Dimensions 
of Physical Quantities 


The coefficient y in the expression for the law of universal gravi- 
tation is a universal constant depending on the choice of units for 
force, mass and distance. It is possible to choose the units in such 
a manner that y = 1. This would require that the unit of mass be 
equal to the mass of a particle attracting a similar mass at unit 
distance with unit force. In the CGS system of units, such a mass 
would be equal to 1.5 X 107 gm, i.e., 15 tons. 

Thus, universal constants in formulas of physics depend on the 
specific choice of units. If we desired, we could eliminate all such 
constants from formulas of physics by appropriately choosing the 
units. 

It is important to grasp the concept that the employed system 
of units and the constants of proportionality in formulas are inter- 
connected. We can demonstrate this interconnection by dimensional 
formulas. First, the number of units that we wish to consider funda- 
mental must be established. This number depends entirely on us 
and is determined exclusively by considerations of convenience. 

A widely used system of units in physics is based on the units 
of length (Z), mass (M) and time (T) as the independent quantities. 
The values of all universal constants and the units of measurement 
of all other quantities are then uniquely determined by the choice 
of units for L, M and T. The nature of this relationship is given 
by so-called dimensional formulas. Several examples will make their 
meaning clear. The dimensions of velocity are L7-1, acceleration— 
TT force—MLT~, the gravitational constant—M—L°7-", elec- 
tric charge in the formula for Coulomb’s law—M'2L°/27-1, etc. 
Knowing these formulas, we can immediately say how the numerical 
values of the universal constants and the units of derived physical 
quantities vary when the magnitude of some fundamental quantity 
is changed. 3 A 

As we shall see by examples in Sec. 81, dimensional analysis of 
physical quantities can be used to predict the nature of some depend- 


ence or other between physical quantities. 
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In addition to the system based on distance, time and mass, 
a system in which the fundamental quantities are distance (L), 
time (T) and force (F) is widely used. This is known as the FLT 
system. Naturally, the dimensional formulas in this system will not 
always be the same as above. Thus, moment of force in the FLT 
system has the dimensions FL, and in the MLT system the dimen- 
sions ML?’ T. Mass, being a derived quantity in the FLT system, 
has the dimensions FL-17?. : 

The fundamental law of mechanics relates the quantities of force, 
mass, distance and time. Therefore, the value of the constant of 
proportionality in this formula depends in both systems on the 
choice of units. In both systems, a constant of proportionality 
equal to unity is assumed. This means that in the MLT system, 
using the formula F = ma, the unit of force is chosen so that F = 41 
when the mass and the acceleration are equal to unity. In the FLT 


p F k à 
system, using the formula m = =n the unit of mass is chosen so 


that m = 4 when the force and the acceleration are equal to unity. 

In this book, we shall for the most part use two variants of the 
MLT system: 

CGS system: L—centimetre, M—gram, T—second; 

MKS system: L—metre, M—kilogram, T—second. 

In the CGS system, the unit of force is the dyne = 1 gm x cm/sec? 
and the unit of work is the erg = dyne X cm. In the MKS system, 
the unit of force is the newton = 4 kg X m/sec? and the unit of 
work is the joule = newton X metre. 

If the reader is confronted with data expressed in the FLT sys- 
tem, these data should be converted into one of the indicated 
systems. To do this, he only need recall that a unit of force in the 
FLT system is a kilogram (the weight of a kilogram mass at sea 
level at 45° latitude), which is related to the two units of force 


adopted by us as follows: 
1 kg=9.81 newtons—9.81 x 105 dynes. 


We shall return to the subject of systems of units when we con- 
sider electrical quantities. 
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CHAPTER Il 
MECHANICAL ENERGY 


9. Work 


Motion without acceleration (i.e., rectilinear and uniform) may 
take place either without the action or with the action of forces on 
the body. In the latter case, the sum of the forces acting on the 
body is equal to zero. There is an essential difference between these 
two kinds of motion. In the first case, motion is not accompanied 
by work, while to achieve the second type of motion work must be 
expended. A motor works, moving an automobile uniformly and 
rectilinearly. A man works, 
moving a sleigh with its load 
uniformly and rectilinearly. 
We say in these cases that p F 
work is expended in overcom- 
ing resistance—friction, air 
resistance, etc. 

Of the two balanced forces 
acting on a body moving with- f 
out acceleration, one is di- a. 1 
rected along and the other op- GOEREE e 
posite to the direction of mo- 
tión. 

We say that a force acting 
along the direction of motion performs work. On the other hand, 
as regards a force directed opposite to the motion, we say that work 
is performed against this force. 

For quantitative evaluation, work is expressed as the product of 
the force acting on a body and the distance traversed by the body. 
The term work is the designation for this physical quantity. 

Let a body be acted upon by a number of forces whose vector 
sum is equal to zero. The body moves uniformly and rectilinearly. 
All the forces may then be resolved into four components (Fig. 15). 
Forces F, and F; in accordance with the adopted definition, perform 
no work. Force F performs work equal to FAS, where AS is the 
traversed distance. The work of force F’ is equal to —F AS, where the 
minus sign indicates that work is performed against the force 2’. 

Let us now consider the motion of a body with acceleration, i.e., 
curvilinear and nonuniform motion. As we already know, a resultant 
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Fig. 15 
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force acts on the body, in this case, that is directed along the 
acceleration (but not along the path in the general case!). Let us 
again resolve all the effective forces into forces directed along the 
motion and perpendicular to the motion (Fig. 16). Now, F is not 
equal to F’, and F, is not equal to F». Using the definition for 
work given above, we can assert, as before, that F, and F, perform 
no work. The work of force F’ is again negative, i.e., the work is 
performed against the force F’ and is equal to F’AS. Force F performs 
the work FAS, which is more 
than the work against the force 
of resistance. The surplus work 
serves to accelerate the body. 
The inequality between the 
forces Fy and F, shows that 
the motion-is curvilinear. Their 
difference F—F, corresponds 
to the normal component of the 


h acceleration. 
Camer . ; 
Direction of motion Let us consider an extreme 
case—uniform motion in a circle. 
Fig. 16 


The resultant force for such motion 
is directed, as we know, along 
the radius of the circle, i.e., perpendicular to the direction of mo- 
tion. Therefore, the centripetal force performs no work. 

Thus, the surplus work in the general case of curvilinear accelerat- 
ed motion is not used to produce acceleration in general, but only 
the tangential component of the acceleration. For a particle, this 
can be expressed as follows: 


F—F’=ma; and FAS —F’'AS = ma, AS. 


We repeat: (F — F’) is the tangential component of the resultant 
force FI“. 

_ The work expended in accelerating a body (being equal, by defini- 
tion, to the projection of the resultant force on the direction of 
motion multiplied by the traversed distance) is equal to the product 
of the mass of the body, the traversed distance, and the tangential 
acceleration. The last equation above can be written in the form 
FAS = F’AS+maAS and can be read as follows: The work 
performed by the effective force consists of the work against the 
force of resistance and the work expended in accelerating the body- 

Examples. 1. A jet peesuger plane, havi 


a height k = 10 km. If it moves unif. 
this height is ba 


ng a weight P = 70 tons, attains 
, the work performed in rising to 


A;=Ph=7xX108 ke-m= 68.6108 joules = 68.6> 1015 ergs. 
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If this height is attained over a path S = 85 km with a simultaneous increase 
in velocity (acceleration a = 0.3 m/sec*), the additional expenditure of work 
in producing acceleration will be 


Ay=maXS=17.9X 108 joules=17.9x10%5 ergs=1.82X 108 kg-m. 
2. To plane a board 2 metres long and 20 centimetres wide, a joiner expends 
about 150 kg-m of work. 
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Thus, in accelerating a body, the resultant force #' performs 
the work 


A=F{* x AS =ma,AS 
where a; is the average tangential acceleration over the portion of 
path AS being considered. Substituting for a, we obtain 


AvxXAS _ y 
=m- mw. x Av, 


average velocity which is equal to 1/3 (vz+v). 
antaneous velocities at the beginning 
then * 


where v is the 
Here v, and vz are the inst r e 
and the end of the path respectively. Since Av = v2 — vı, 
mv3 mv mv2 
amai ahaa), 


i.e., the work is numerically equal to the increment in the value 


of La Therefore, the quantity 
mv? 
K= ae 
is employed as a measure of the energy of motion of a particle. 
This quantity K will be called kinetic energy. The previous equation 
may now be read as follows: The work of the resultant force acting 
on a body (i.e., the product of the tangential component of the 
resultant force and the traversed distance) is equal to the increment 
in the kinetic energy of the body. This equation is convenient for 
the solution of elementary mechanical problems in which the path 


a ich the force acts is given. ; Oe 
awe walt repeatedly be dealing with the term “energy - It is one of 


the most important physical concepts. Energy, i.e., work capacity, 


is obtained if we write the expression for an infinitely 
sm fil one TEn in the form dA = pe dv ana integrate it, from the mo- 
= ches vz: 
ment the velocity was vı to the moment-it rea E 


to my A er 
A= f mv dv=— 7 Dey 
vi 
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is a function of the state of a body. Work is produced at the expense of 
a decrease in the value of this function. Kinetic energy is a function 
of the state of motion. If the kinetic energy changes from K4 to Ka, 
then the work performed thereby is equal to K.—K,, independent of 
the nature of the motion. It is of no importance whether the velocity 
changed rapidly or slowly, uniformly or nonuniformly. The decrease 
in the kinetic energy by a specific amount always yields the 
same amount of work. 

Only in the case when the physical quantity is a function of state 
can it have the sense of energy, i.e., a store of work. 


Examples. A unit of energy in atomic physics is the electron-volt (ev). This 
is the kinetic energy of an electron accelerated through a potential difference 
of 1 volt: 


1 ev=1.6x10-12 erg. 
The energy of a proton accelerated in a synchrotron is 10 Bey = 10! ey = 
= 0.016 erg. 
The kinetic energy of-a large jet passenger plane (m = 100 tons and v = 
= 800 km/hr) is 


2.5X 1018 ergs—2.6109 joules = 2.5% 108 kg-m. 


11. Potential Energy 


Let us consider several phenomena in which the performed work 
is: not accompanied by a change in the velocity of the body. We 
shall be concerned with two types of problems. The first is related 
to the elastic deformation of a body, while the second deals with 
events occurring during the motion of a body in a gravitational or 
electric field. We shall presently show that in both cases we shall 
be dealing with the transformation of work into a special variety 
of energy, so-called potential energy. 

Elastic deformation phenomena will be treated first. Experiments 
show that for any elastic deformation—extension, compression, 
flexure, etc.—one can always find a function of state that increases 
precisely by the magnitude of the work performed on the body. 
This function of state or, in other words, the function of the body’s 
properties and degree of deformation is called the potential energy 
of elasticity. 

_ We shall show this energy to exist for a case of elastic deforma- 
tion, namely, linear extension or compression. Analogous examples 
could be given for any other kind of elastic deformation. 

Let some force, such as a muscular force, stretch a solid body, 
e.g., a spring, very slowly. The work expended in stretching the 
body from length Z + s, to length Z + s2, where Z is the length of 
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the unstretched spring, is 
A= F (s2 — s;). 
The muscular force is balanced by the elastic force of the spring 


at every given moment. For a small extension of the spring, the 
latter is proportional to the deformation s*: 


Fe = ks. 
In the expression for work, we must use the average value of 
; 1 ; 
the force F, i.e., + (ks, + ksz). We then obtain**: 


ne ksi ksi _ (+ ) 
A= oil 


2 a 2 


i.e., the work against the elastic force is expended in increasing 
r ks? nye 3 f 
the quantity — This quantity is, therefore, adopted as a measure 


of the elastic energy. The quantity 
Uei = i 
will be called elastic potential energy. 

Elastic potential energy formulas for other kinds of deformation 
have exactly the same form. The body’s stiffness-with respect to 
a specific form of deformation is characterised by k, while s is 
a measure of the deformation (for example, twist angle, displacement 
angle, etc.). 

The quantity Ue; is energy in precisely the sense referred to at 
the end of Sec. 10. Irrespective of the manner in which a body is 
deformed and the rapidity of the process, the same amount of 
expended work will always correspond to one and the same incre- 


r ks? my ks? . 
mental value of the quantity 2E Thus, = is a measure of energy 
or, to be more precise, of elastic potential energy. 
Examples. 1. The potential energy of a piece of steel wire (Young’s modu- 


lus Æ = 21,000 kg/mm?) having a length of 50 metres and a cross-section of 
10 mm2, and which is stretched 1 cm, is 


ks? r $ 
Ue =; ~ 20X106 ergs = 2 joules œ 0.2 kg-m. 


* It should be recalled that the law of elastic deformation (Hooke’s law) 


F s fe bas 
is written in the form = ET , where Æ is the modulus of elasticity and S 


S 
is the cross-section of the stretched body. Thus, -the stiffness (the constant 
of proportionality in the expression for the elastic force) has the valuek = = 3 


** The same result is obtained when we integrate the infinitely small amount 
of work dA = — ks ds between the limits sı and sz. 
4* 
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2. For rubber, Young's modulus Æ = 8 kg/cm?. A stone having a mass 
of 20 gm is shot from a slingshot to a height of 20 metres. This requires that 
an energy of 0.4 kg-m be imparted to the stone. Assume that the elastic band 
has an initial length of 40 cm and stretches an additional 40 cm. Let us deter- 
mine the required cross-section for the elastic band. 


ES s? ~ aw _ 2x40 (em) x40 (kg-cm) _ 
Ue Dat j 


Ua = 


0.25 cm?. 
Es? k Pa 
8 (3 ) 1,600 (cm3) 


Gravitational force possesses the same feature (discussed above) 
as elastic force. Thus, work expended in lifting a body in a gravi- 
tational field serves to change the body’s function of state. In this 
case, the function interesting us depends on the position of the 
given body with respect to the bodies attracting it. This function 
is called gravitational potential energy. 

We shall show this energy to exist, first, for a body located close 
to the Earth’s surface. From point 1, the body is moved to the 
higher point 2 along some curvilinear path. Let us divide this 
path into small segments, replacing the curved line by a broken line. 
The latter can be made to approximate the former to any desired 


accuracy. The work expended in moying a body along one of these 
linear segments of length dl is then 


dA=mg dlsina or dA=mg dh, 


where dh is the increase in height. Since mg does not change along 
the entire path of motion, we can place it before the brackets (before 
the integral sign when integrating) in writing the expression for 
the work expended along the entire path: 


A=mg (ħy,— hy), 


where hy and k, are the heights of points 1 and 2 respectively. 
Furthermore, 


A= (mgh)s —(mgh), = A (mgh), 


i.e., the work of displacement is equal to the increase in the prod- 
uct mgh, which is a measure of the gravitational potential energy 
for this simple case. ; 

It is quite evident that 

U=mgh 

is energy and is in complete accord with the meaning we have 
assigned to this term. Irrespective of the manner in which the work 
is performed, i.e., the path taken by a body and the speed of the 
motion, the work of displacing the body from point 1 to point 
2 will-always be the same, since the increase of energy depends 
ar on the location of these points—in our simple case, on their 

eights. = 


x ; 
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Since the work of displacing a body in a gravitational field does 
not depend on the path, the work of displacement along a closed 
curve will be equal to zero. 

It should be noted that it is immaterial what level we choose 
as our base for hk. If it is agreed to calculate from the Earth’s 
surface, the potential energy of a body at the bottom of a well will 
be negative. 

The above formula is not valid for bodies that are far from the 
Barth, e.g., the Moon. Thus, as was explained in Sec. 2, mg, the 
approximate expression for the gravitational force, should be 


replaced for large distances by the exact expression %y 


Let us calculate the work done by the gravitational force» Work 
performed by the forces of a system will be considered positive, 
while work against the forces of the system will be considered 
negative. Let us assume that two attracting bodies draw together 
along the line of action of the forces over an infinitely small segment 
—dr of the path (minus, since 7 decreases). Thus, 


mmo, 
aS. 


da= —y ot dr. 
Bub =a (+) = Therefore, 
es r 
dA=—d (re) 


Work takes place at the expense of a decrease in the value of 


U = ye, which is a measure of the gravitational energy in 
the general case: 

dA= — dU. 
The quantity 

ERAR mmo 


r 


represents gravitational potential energy in the general case. 

U is equal to zero if the bodies are infinitely far apart. When 
the bodies draw together, U increases in absolute value. But since U 
is negative, we see that, just as with the approximate formula for 
bodies close to the Earth, the potential energy is less the closer the 
attracting bodies are to each other. Naturally, if we desired, we 
could change the base line for U and make this quantity positive 
in the interval of values concerning us. . 

It is not difficult to show the relationship between the general 
formula for U and its particular case when U=mgh. Thus, replac- 
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ing r by R+h, where R is the radius of the Earth, we obtain 


i yMm a yin 
co TEN 
i R 


(M is the mass of the Earth). But since 4 is a small quantity, 


s È 5 h 
we can write with sufficient accuracy r= 4 — =, whence 


i= ym mgh. 


Changing the base line for U so that zero potential energy for 
a body is at the Earth’s surface, the formula reduces to U = mgh. 


Ezample. To obtain a clearer picture of the meaning of the above results, 
let us calculate the potential energy of a body of mass m = 1 kg at the Earth’s 
surface and at a distance of 1,000 km above its surface. 

The potential energy at the surface of the Earth is 


ny Mm 4 5.8 1027 103 a SEA TIM: carrey 
Se ere pei 6.3xtos = —6-1x10!4 ergs. 


The potential energy at a height of 1,000 km is 


1 5.8x102 x103 Le ae 
DRIP TEKA A AALOE 


Us ,000= 


_, From the calculations, it is evident that 1) the potential energy of a body 
in the Earth's gravitational field is always negative and increases with its 
distance from the Earth. (since we have agreed that it tends to zero when 
h — œ); 2) the change in the potential energy of a body rising above the 
Earth’s surface is, generally speaking, not described by the expression 
mg (he — hy). Thus, 


Y1,00—Uo= —5.3x 4014 — (— 6.131014) —0,8 4014 ergs= 816,000 kg-m, 
while the calculation with the expression mg (he — hy) yields 1,000,000 kg-m. 
Reel, when we are concerned with ascensions to a height k < R (R is the 
ae ibs $ E Earth), it is permissible to use the simplified expression 

2— fy). 


for gravitational potential energy. 


; rges q; and q having the same 
sign and separated by a distance r n 1 A 


: bed | - According to Coulomb's law, 
the Particles will repel each other, Therefore, in reducing their 
Separation to the small distance dr, 


we perform work equal to 
=d4 = — 42 dr (the minus sign is us 


ed in the left-hand member 
because the work is performed against the forces of the system; 


7 


Rici’ 
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the right-hand member also has a minus sign because the distance 
is decreasing and dr is negative). The calculation, which in no 
way differs from that for gravitational force, yields for the energy 
of electrical interaction of charges (for brevity called Coulomb 


A 
r 


energy) the expression U = , i.e., here too dA = — dU. 


The interaction energy of charges having opposite signs is nega- 
tive and behaves like gravitational energy. The interaction energy 
of charges having the same sign is equal to zero when the charges 
are separated by an infinite distance; it increases as the charges 
are brought together. 

We shall restrict ourselves to these examples of potential energy, 
although in various cases other functions of state of a body may 
be introduced. 

Potential energy always exists when forces act between bodies 
or particles of the system under consideration that depend on the 
distance between the bodies. Potential energy is the interaction 
energy of the bodies. If a system consists of a number of bodies 
or particles, we can then speak of its total potential energy, i.e., 
the interaction energy between all the particles (each with all the 
rest). Thus, in the case of four particles, the potential energy is 
composed of six terms, for we must consider the interaction be- 
tween the first body and the second, third and fourth; the second 
body and the third and fourth; and finally the third body and 
the fourth. 

In mechanics, only the potential energy of forces acting between 
different bodies is considered. If a body is complex and consists 
of many particles, the interaction potential energy of these parti- 
cles is considered to remain unchanged during the mechanical proc- 
esses. The interaction potential energy of the particles comprising 
the body is a component part of the body’s internal energy (Chap- 
ter IX). If changes in a body’s internal energy take place, the 
phenomena must be considered in the light of the laws of thermo- 


dynamics (Chapter IX). 
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Irrespective of the type of forces involved in the motion, the 
work of the resultant force is always equal to the increment of 


the body’s kinetic energy, i.e., 
mv? 
Fas=A (25) 


The forces acting on the body could be elastic forces, gravitational, 
electrical and frictional forces, etc. 
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It is always possible to separate from the effective forces those 
whose work serves to change the potential energy. For brevity, 
such forces are sometimes called potential forces or forces possessing 
potential. The work equation may be written in the form 


FyoAs+ fAs = A (25) Y 


Here, f represents the nonpotential forces. The work of these forces 
is equal to the change in the internal energy of a body or the medium 
in which the body moves. 

Substituting in place of the work of the potential forces the 
increment of potential energy with reversed sign, we can write 
the equation in the form 


fAs=4 (4. + U). 


The sum of a body's potential and kinetic energy is called total 
mechanical energy. Designating this quantity by E, we obtain: 
fAs = AG, i.e., the change in a body's total energy is equal to 
the work of the nonpotential forces, e.g., the frictional forces. 

If the work performed in changing the body’s internal energy 

is small with respect to 6, the equation simply reduces to the 
following: A€é = 0 and € = const. This is the law of conservation 
of mechanical energy, which states that the total mechanical energy 
of a body is conserved. 
— This law may be easily generalised for a system consisting of 
many bodies or particles. For each body we may write a work 
equation and then combine these equations into one. The total 
energy will then be equal to the sum of the kinetic energies of the 
bodies and the potential energy of interaction: 


g= = a mt eda EU. 


If all the interacting bodies are taken into account (such a system 
of bodies is called a closed system), the form of the law remains 
the same as for a single body. The change in mechanical energy 
is equal to the work of the nonpotential forces and, if this work 
is negligible, the total mechanical energy of a closed system of 
bodies remains unchanged, i.e., is conserved. 

The law of conservation of mechanical energy is, on the one 
hand, a consequence of the equations of mechanics (Newton’s law); 
on the other hand, it may be considered as a special case of a more 
en) law of nature—the law of conservation of energy (Chap- 

Even in mechanics alone, many forms of interconvertible energy 
are met. In considering the motion of a body under the action 
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of elastic forces or gravitational force, it i i 
increase in the energy of one of the A cE eee ak a 
panied by a decrease in the energy of the other form i? 
Thus, the gravitational force acting on a falling body decrease. 
2 { s 
the potential energy of the body and increases its kinetic energy 
The reverse is true when a body is lifted to a certain height The 
elastic force making a ball thrown against a wall rebound decreases 
the potential energy of the compressed ball and transforms it into 
kinetic energy. The reverse takes place when the wall stops the 
thrown ball (the interval from no deformation to maximum com- 
pression). . 

A stretched spring can raise a load to a certain height. On the 
other hand, a falling load can stretch a spring. Thus, elastic energy 
can be transformed into gravitational energy and vice versa. 

The above examples apply to the transformation of one form of 
energy into another in one and the same body as well as to the 
transfer of energy from one body to another. 

It is possible, of course, to transfer energy in the same form from 
one body to another: one load pulls another by means of a pulley, 
a sphere colliding with another transfers part of its kinetic ener- 


gy to it, etc. 
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action of bodies or particles depends 
i.e., it is always a function of the 
describing the location of these 
the potential energy. may 


The potential energy of inter: 
on their relative distribution, 
coordinates or other parameters 
bodies in space. In the simplest cases, 


depend on a single coordinate. ' : 
Let us consider the interaction of two particles whose potential 


energy of interaction is described by the function U(x), where x is 
the distance between the particles. For the sake of definiteness, 
let us assume the particles repel each other with a force F. Under 
the action of the interaction forces, the distance between them 
increases by da, i.e., an amount of work equal to Fda is performed. 
This is possible at the expense of the potential energy of interaction 
U, which changes by —dU (decrease of energy). 
Thus, —dU=Fdz 
or 


es the force is equal to minus the 
y with respect to v. The nature 
y simple and is clearly described 


i.e., in the case of potential fore 
derivative of the potential energ 
of the mechanical problem is then ver 
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by so-called potential curves, i.e., graphs on which the values 
of the potential energy are plotted as a function of the parame- 
ter (Fig. 47). 

In explaining the essence of this graphical method, the analogy 
is usually drawn with the motion of a body on a hill. The meaning 
of the potential curve then becomes particularly clear, for the 
profile of a hill and a potential energy distribution curve that is 
proportional to the height hk of the hill coincide if drawn to prop- 
er scale. 

Potential curves consist of crests and troughs, steep and gradual 
rising slopes as well as steep and gradual declining slopes. The 
form of the curve permits us to immediately indicate on which 
portians of the path a large amount of work is performed, on which 
a small amount, and whether the work is positive or negative in 
each case. The steeper the potential curve, the larger the force 
acting on the body. In accordance with the familiar geometric sense 
of the derivative, force is described by the tangent of the angle of 
inclination of the tangential line to thé potential curve. 

The validity of the formula relating potential energy and force 
is completely evident for those particular cases of potential energy 
that we have considered. For the potential energ 


ota body on the 
Earth's surface: a - 
d y 


ie ; e e c E 
U = mgh and F= an ~ ME- 


For a body in a gravitational field, in the general case: 


yy MMe dU. mmo 
U vE d F= A a 
For a body subjected to elastic action: 
k2 
y= = wd Fa See. — kr. 
2 dx 
For electrical interaction: 
7 
U i ps aU 192, 
fo dr r2 


Returning to the potential curve plotted in the diagram and 


keeping the above explanation in mind, we can immediately indi- 
cate where the force is greatest and the points where the force acting 
on the body is equal to zero. The latter points, i.e., the positions 
of equilibrium, are at the bottom of the potential well and at the 
potential peak. The positions where the potential energy is a maxi- 
mum correspond to unstable equilibrium, while the bottom of 
the potential well is a position of stable equilibrium. 
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We stated above that the form of the potential curve permits 
us to describe the possible motion of the body. This is not com- 
pletely accurate. In addition to the potential curve, we must also 
know the value of the total mechanical energy of the body. If this 
value is known, we can then indeed deduce from the form of the 
potential curve the possible motion of the body or particle. 

Horizontal lines are drawn in Fig. 47 at the ordinates corre- 
sponding to é, and é». If € is the total energy of the particle, 
then we can determine the 
kinetic energy as well as the U(z) 
potential energy from the 
curve. The former is the 
difference between € and U. 

The moving particle cannot 
occupy positions in which the 
potential energy is greater 
than the total energy. Thus, 
the horizontal line € restricts 
the possible motion of the 
body to certain portions of 
the curve. In the case when 
the energy is represented by z ; 
the lower line éi, the moving fanaa ee, 
point has two possible inter- 
vals in which it may be locat- 
ed. It may be either in the 
potential well (and have an oscillatory motion there) or on the 
slope to the right of point A, where it will move downwards or 
upwards depending on whether it acquires or loses kinetic energy. 

The above analysis is completely valid for any kind of potential 
curve. Fig. 18 shows several types of such curves. Thus, in Fig. 18a, 
we see the potential curve for a body oscillating on a spring. The 


Fig. 17 
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Vix) = u(x) 
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oscillating body is in a potential well having symmetrical edges. 
In Fig. 18b, a potential curve is shown that is typical for many 
interacting particles, e.g., atoms and molecules. The curve consti- 
tutes a potential well, one of whose edges has a very steep slope 
-while the other has a gradual one. Plotted along the abscissa is 
the distance between the particles. As can be seen from the curve, 
the potential energy is very large at small distances, falls with 
increasing distance, reaches a minimum, then gradually rises tend- 
ing toward some finite limit. The nature of the motion and the 
bond between two interacting particles are completely described by 
this curve. Two cases should be distinguished. The first is when the 
total mechanical energy of this pair of particles is represented by the 
lower horizontal line €,;. The second is when the total energy is 
equal to €). In the first case, the system cannot get out of the 
potential well. This means that the distance between the particles 
lies between the limits indicated in the figure. The mutual motion of 
the particles can only be of an oscillatory character. Such is the 
situation in a stable diatomic molecule. The second case is the reverse 
of the first. The total energy of interaction of the particles is too 
large for them to be constantly linked. The system may get out of 
the potential well, i.e., the bond between the particles cannot exist 
and the particles may fly apart to any distance whatsoever. 

The third potential curve in the figure is a so-called square well. 
Recalling that force is described by the tangent of the angle of 
inclination of the tangential line to the potential curve, we see 
that the potential energy may be represented in the form of a square 
well if the body or particle moves freely without the action of 
forces, yet cannot leave the bounds of the given portion of the 


curve as long as the total energy is less than the height of the sides 
of the well. 


CHAPTER III 
MOMENTUM 


14. Conservation of Momentum 


The product of the mass of a body, or particle, and its velocity 
is known as the body’s momentum (quantity of motion): p= mv. | 
The momentum p is thus a vector quantity. In a system of bodies 
or particles, the momentum is equal to the vector sum of the par- 
ticles constituting the system: 

P=pit Pt... 

What makes this vector quantity of particular interest to the 
physicist is the fact that in a closed system the vector P does not 
change, irrespective of the motion within the system itself. This 
proposition is known as the law of conservation of momentum. 

The, law of conservation of momentum follows directly from 
Newton’s laws. For each of the bodies in the closed system, the 
following equation is valid: 

e d — FF 
aE (mv) =F, 


i. @., 

dp A 

ar A 
Let us consider what happens when we write such an equation 
for each of the bodies and then add the equations. The right-hand 
member of each equation represents the forces exerted on the given 
body by all the other bodies. Thus, the force exerted on the first 
body is equal to the sum of the forces exerted on it by the-second, 
third, etc. Using double indexes, we may write: Fis + Fis + Fut... 
Similarly, for the forces exerted on the second body, we may write: 
Fy + Foo-+Fo3+.--3 for the third: #'3;+ M3.+ B33+...; etc. 
It is not difficult to see that when the right-hand members of the 
equations are added the result is zero. For each term in the first line, 
there is always a term in another line that is equal and opposite 
to it (in accordance with the law of action and reaction). Thus, 
when Fiz and F, are added the result is zero; also Fi; and Is); 
etc. Therefore, in a closed system, the following equation holds: 


apy dp, , dp d 
di n ra jae 03 di (P+ P2+Pst-.-)=0 


or i 
Pı +p: + ps+... = const. 
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This is the law of conservation of momentum. The magnitude and 
direction of individual moments may change, but their vector sum 
in a closed system does not. 


Magnitudes of some momenta: The momentum of an electron with an 
z : i -o gMX CM 
energy of 5 ev is ~ 12x10 ao 


m <em k tres p ga  kgxmetres 
en- T g Agnete, ; and-that of a freight train is ~ 107 Sl ‘ 
sec sec sec 


; that of a rifle bullet is ~ 8» 105 


15. Centre of Mass 


The methods of finding the centre of gravity of a body are well 
known. If a body is fixed at its centre of gravity, it is in a state of 
neutral equilibrium. For a sys- 
tem of particles, ora solid bedy 
considered to be broken up into 
elementary elements having 
the size of particles, we can 
write an analytical expression 
for the position of the centre 
of gravity. 

RENE Using the rule for adding 
5 (m, tme + m) 9 parallel f (Fig. 19), we 
obtain the ing expres- 
sion for the position of the 
centre of gravity when the 


Sal particles are considered to be 
distributed along a straight line, say the z-axis: 


3 
< 


3 
a 


a3 


-4-------4 


Fig. 19 


x mızı +mots--mgt3+... 
y2 
m+- m+- mg}... 


Here, zi 22, x3 ... are the coordinates of the particles, and Mi, Ms, 
mg ... ave the masses. Masses are used instead of weights since the 
acceleration of the gravitational force cancels out. 

It is shown in theoretical mechanics that for any distribution of 


particles the expression for the position of the centre of gravity 
has the form: 


Ram mars mgrg+-.., 
my--ma+m3+....? 
where R is the radius vector of the c 
vectors of the particles. 
Since the acceleration of the gravitational force cancelled out in 
these formulas, we can conclude that the point found has an objective 
significance that does not depend on the gravitational conditions. 


entre and 24, 72, 73 are the radius 
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It is valid, in fact, even if the? body is located in interplanetary 
space under conditions of weightlessness. It makes sense, therefore, 
to replace the prevalent designation “centre of gravity” by a desig- 
nation that more accurately expresses the essence of the matter. 
Thus, we speak of the centre of inertia orcentre of mass of a body 
instead of its centre of gravity. 

We shall directly see the full significance of this designation. Let 
us consider the velocity of motion of the centre of mass: 


Using the formula for the determination of the centre of mass, 
we obtain: 
rs my Uy-+Mmyvo+ mgl3-+... 
my mg-+-m3+... 


In the numerator we have the total momentum, which is conserved 
in a closed system. Thus, the right-hand member of the equation 
is equal to a constant quantity. We can conclude, therefore, that 
the velocity of the centre of mass does not change in magnitude 
or direction. Or, in other words, the centre of mass of a closed system 
of particles executes inertial motion. 

As we already know, all inertial systems of coordinates have equal 
validity. Hence, we can always go over to a coordinate system bound 
to the centre of mass of the system under investigation and consider 
this interesting point as fixed. In atomic physics, we often consider 
collisions between particles. To study this phenomenon, two systems 
of coordinates are used—the laboratory system (the natural coordi- 
nate system of an observer) and the system bound to the centre 
of mass of the colliding particles. The advantage of the latter frame 
of reference is evident: the total momentum of the particles is 
equal to zero. 


16. Collisions 


The word “collision” should be understood in a somewhat broader 
sense than that used in everyday practice. For the mechanical 
problems that now concern us, any encounter between two or more 
bodies in which the interaction is of short duration will be consid- 
ered to be a collision. Thus, in addition to the phenomena that 
can be classified as collisions in the usual sense of the word—e.g., 
impact of billiard balls and collisions between atoms and atomic 
nuclei—we have such events as a man jumping on or off a street 
car and a bullet hitting a wall. The forces arising as the result of 
such short interactions are so great that the role of all constant 
forces being exerted is negligible. As a result we are justified in 
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considering the colliding bodies as a closed system and we can 
apply the law of conservation of momentum to them. 

In many collisions, the duration of the interaction is measured 

in thousandths of a second. During this 

F interval of time, the force rises to its 

maximum value and then drops to zero. 

A typical curve for the force during such 

an impact is shown in Fig. 20. For each 

instant of time during the impact, the 

relationship between the force exerted on 

either of the bodies and the momentum 

of this body is given by Newton’s sec- 


ond law: 
d 
f; qr (mv) =F. 
Fig. 20 Rewriting this equation in the form 


FAt = A(mv), we can say that the prod- 

uct of the average value of the force 
and the duration of its action is equal to the change in momen- 
tum. A more accurate description of the phenomenon is obtained 
if we integrate the above equation from the initial instant 
of impact to the termination of the interaction. It is evident 
that s 


a 
t é 


f F dt = (nv) — (mv). 
0 


The integral on the left is sometimes called the impulse of the 

force. In the diagram, this quantity is represented geometrically 
by the area under the impact curve (see Fig. 20). 
_ There is considerable variation in the nature of collisions, depend- 
ing on the elastic properties of the bodies. It is customary to consid- 
er two extreme cases—ideally elastic and absolutely nonelastic 
impacts. 

First, let us consider the latter type. A nonelastic impact means 

» an encounter between two bodies whereby, these two bodies become 
joined. Examples of nonelastic impacts are collisions between clay 
spheres, a man jumping onto a moving trolley, the collision between 
oppositely charged ions resulting in the formation of a molecule, 
and the capture of an electron by a positive ion. 

Assume that the bodies moved with velocities v} and vy before 
the encounter. Thus, the total momentum Was mv, move. After 
the encounter the bodies have a common mass equal to my--m2 
and move with some velocity V. The momentum of the system 
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after the encounter is (mı+mə) V. Since the law of conservation 
of momentum requires that 


(my + m) V = mv + MgVo, 
the velocity of the bodies after the nonelastic impact is given by 


the formula: 
y _ mtm 


The momentum after the encounter should equal the sum of the 


momenta before impact. 

If the motion of the bodies colliding head-on is along a straight 
line, then after impact the bodies will follow the direction of the 
body having the originally larger momentum. If the momenta of 
the bodies are equal in magnitude, mv = —mv2; V is thus equal 
to zero, i.e., the colliding bodies come to a standstill. 

A nonelastic impact is accompanied by a transformation of 
energy. From the example just given, it is seen that the kinetic 
energy may even become zero. It is not difficult to calculate the 
increase in the internal energy of colliding bodies in one or another 
case. All that we need do is perform the following subtraction: 

my-+ m2 y2 mwÌ , Mavs 
i ; < E a= aaor ) 

Let us now consider ideally elastic collisions, i.e., collisions in 
which the form of the bodies is completely restored. This means 
that no changes occur in the state of these bodies, their potential 
and internal energy before and after impact remain unchanged and, 
consequently, the kinetic energy is conserved. For two bodies col- 
liding in this manner, two equations can be written that are based 
on the law of conservation of momentum and the law of conservation 
of kinetic energy. Let us designate the masses of the bodies by m 
and M. We can always make the origin of the coordinate system 
coincide with the position of one of the bodies. This simplifies the 
problem without in any way making it less general. Let us assume, 
therefore, that the body having mass M is at rest before impact. The 


above laws of conservation then yield the following two equations: 


mu=mv-+MV and = mu? = z mv? ms MV?. 


Here, u and v are the velocities of sphere m before and after impact, 
and V is the velocity of sphere M after impact. ‘ 

Let us consider several examples using these equations. First, 
we shall examine the case of noncentral* collision of two spheres 


* The impact is classified as central if the motion of the spheres before 
impact occurred along a straight line passing through the centres of the 
spheres. 


5—1409 
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having equal mass (Fig. 21). The masses cancel out in both equations 
and we obtain 


w=v+V and =v? +4 V?. 
F 


From the vector equation, it is clear that the vector w closes the 
triangle formed by vectors v and V. The equation on the right shows 
that the triangle, for which w is the 
hypotenuse, must be a right triangle. 
Hence, it follows that the velocities, 
after the collision of two particles hav- 
ing equal mass, must be directed at 
right angles to each other. This inter- 
esting conclusion is easily verified in 
billiards, where the directions of motion 
of the object ball and the cue ball form 
an angle of 90°. In other respects, the 
nature of the velocity change is not de- 
termined by our equations, for they do 
not take into consideration the deviation 
of the line of impact from the line passing through the centres 
of the spheres. 

A complete description of the motion of the spheres after impact 
is obtained if we restrict ourselves to the case of centr impact. 
The motion of the colliding spheres will then be alte he same 
straight line after impact as before impact. We can dispense, there- 
fore, with the vector notation, keeping in mind, however, that 
a change in the yelocity’s sign means that the direction of motion 
has changed. In this case, there is no need for making the simpli- 


fying assumption of equal masses. The equations for central colli- 
sion have the form 


mu =mv-+-MV and mu? = mv? 4- MV2. 


Rearranging terms, these equations can be written in the form 
m (u—v)=MV and m (u*—v?) = MV?. 


Dividing the latter by the former, we obtain: u + v = V or u = 
=—(v—V). Note that the relative velocity of motion of sphere 
m with respect to sphere M before impact (designated by u) is 
equal in magnitude to the same relative velocity after impact. 

An interesting formula is obtained when we substitute V = u + v 
in the formula for the law of conservation of momentum. We obtain 
an expression for the velocity of sphere m after impact in terms 
of the velocity of this sphere before impact: 


m—M 


Empe 


POOO nT ee 
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If the masses of the spheres are equal, the velocity v reduces to 
zero. This phenomenon can be demonstrated very effectively with 
steel or ivory spheres. For such an impact, the spheres, so to speak, 
exchange velocities (Fig. 22). In other cases, sphere m is retarded. 
The closer the values of the masses of the colliding spheres, the 
more effective the retardation. 
It is not difficult to calculate 
that when a neutron (mass 1) 
rebounds from a carbon atom 
(mass 12) it loses */,, of its ve- 
locity and when it rebounds from 
a uranium atom (mass 235) it 
loses only ?/235 of its velocity. 
For macroscopic bodies, the 
laws of elastic impact are quite Fig. 22 
valid for such materials as ivory, 
steel and rubber. These materials, after having been deformed, are 
able to reassume, to a high degree, their original form. This is 
illustrated by the interesting photograph shown in Fig. 23, where 
by means of slow-motion photography 
the moment of impact of a hockey 
ball on an obstacle is filmed. In 1/5,000 
of a second, the ball is compressed 
almost one centimetre, and it takes 
the same amount of time for the re- 
storing phase of the impact. In the 
first phase, the kinetic energy of the 
impact is transformed into potential 
energy of elastic compression. In the 
second phase, the potential energy is 
transformed back into kinetic energy. 
For an ideal impact, this reverse proc- 
ess should completely restore the 
value of kinetic energy expended dur- 
ing the first phase of the impact. 
Our formulas are not applicable for 
the important case of elastic impact 
of a sphere on a wall (Fig. 24). Since 
the kinetic energy must be conserved, 
the velocity of the sphere cannot change 
Fig. 23 in magnitude. As regards the direc- 
tion of the sphere’s motion afterimpact, 
it should form the same angle with the normal (90°—a) as before 
impact. Thus, in the case of impact on a smooth wall, the tangential 
component of the velocity remains unchanged, since no tangential 


5* 


ce all 
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adhesion forces are exerted by the wall. As can be seen from the 
figure, the increment of the momentum is numerically equal to 
2mv sin @ and is directed along the normal to the wall. According to 

the. fundamental law of mechanics, at the 
Yj instant of impact, the force exerted on the 
AA sphere by the wall has the same direction as 
that of the vector of momentum change. 
The angle of incidence of the sphere is, there- 
fore, equal to its angle of reflection. 


Let us consider an inelastic impact, using as our 
example a ballistic pendulum (a device for measuring 
the velocity of a bullet). A bob containing sand, mass 
M, hangs from a line. The bullet is fired into the bob 
and becomes imbedded in the sand. The momentum 
of the bullet before impact is mu, and the momen- 
vi tum of the system after impact is (M +- m) v. Hence, 


V'sina=iV sing 


7 m 
=— U; 
m-4-M 


Mv2 
2 


Fig. 24 
Acquiring the kinetic energy ı the bob expends 
it in rising to the height h, determined by the following condition: 


EMU u? m \2 
Mgh= zi i e, Lom (ir) (n M). 


If M = 10 kg, m= 10 gm and u = 900 metres/sec, then h = 4 em, A 
If we had not used the law of conservation of momentum in determining h, 
but had assumed instead that the total kinetic energy of the bullet had been 


transformed into potential energy of the pendulum, we y 


5 zould have obtained 
the value k = 40 metres (!). This means that, in our oxattl a, 399.6 kg-m of 


mechanical energy, or 99.9% of the total suppl , has “disa edy, i.e. 6 
to heat the system. Since absolutely elastics jodi disap A mechanica 
energy is not conserved for “elastic” impacts 
into energy of thermal molecular motion 
to this example in Chapter XI (p. 481). 


Using an example involving collision we shall now illustrate 
the merit of a coordinate system bound to the centre of mass. 

Assume that a sphere of mass m, at rest in a laboratory coordinate 
system, is hit by a similar sphere with velocity v. If the impact is 


f f 3 mp2 5 ` ; 
inelastic, some portion of -z > the kinetic energy of the system, is 


transformed into heat. In other coordinate systems, the kinetic 
energy of this pair of spheres is expressed by other quantities. As 
regards the heat released, it will be the same for the given pair of 
spheres and is simply determined by the velocity of their relative 
motion. Therefore, instead of resorting to the law of conservation 
of momentum to try to determine the portion of the kinetic energy 
that is transformed into heat, calculated for the laboratory coordinate 
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system, it is sufficient to calculate the kinetic energy for a coordi- 
nate system bound to the centre of mass. Since in such a coordinate 
system the total momentum of bodies is equal to zero, after an 
inelastic collision the spheres come to a standstill: all the kinetic 
energy is transformed into heat. The kinetic energy will have 
a minimal value for a system bound to the centre of mass. 

In a coordinate system bound to the centre of mass, the spheres 


x ys 1 ž r 
move toward each other with velocities —v. The kinetic energy of 
s ; { Wen y 

each sphere is thus equal to = mv“ and the total energy of the system 
ee. 4 ae 4 m z 
is = mv*. This is the amount of heat released during an inelastic 
collision. Irrespective of the type of impact, the heat (or other form 
of energy) released due to the kinetic energy of the bodies cannot 
exceed the amount of kinetic energy calculated for a system bound 
to the centre of mass. And conversely, in order to release a given 
amount of heat, it is necessary to have the equivalent amount of 
kinetic energy calculated for a centre of mass system. 

Example. A nuclear reaction in which a-particles bombard nitrogen: N*4 
takes place in accordance with the following equation: 

NU Het —> 07 -4H1. 

The energy absorbed in this process amounts to 1.13 Mev. How much kinetic 
energy in a laboratory system must an a-particle possess in order for the reac- 
tion to proceed? At first glance, it seems that 1.13 Mev is sufficient for this pur- 


pose. But we already know that this is not the case. In a centre of mass coor- 
dinate system, 1.13 Mev is required, but in a laboratory coordinate system, 


more energy is needed. 


: . mv mov 

Thus, the velocity of the centre of mass is ve e Eat 
__ my 2 

is the momentum of the first particle and mvg is the momentum of the 


second. The velocity of the first particle in a centre of mass coordinate 


, where myvi 


ma (vı — v). For the second particle, we may 
my- Mme 


mi (vp—v,). Hence, the kinetic energy of the sys- 
mı- mg 


system is vi =%y—Ve 


write: vg =V2— V= 
cs F ‘it 3 
tem (æ, N14) in a centre of mass coordinate system is Kem=5 H (v — v)?, 


where p= N is the so-called reduced mass of both particles. We shall 
my mg 

consider the nuclei of NH fixed (vx —=0). This assumption is justified, since 

we can always neglect the slow thermal motion of the target nuclei as 

compared with the large velocity of the bombarding particles. The kinetic 


7 s 1 
energy in the laboratory coordinate system is then Kiw=5 m,v? and there- 


fore 
my-+ mo 


Cap =K. 
Kiab om ine 
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The reaction proceeds if Kemn—=1-13 Mev. Since my=4 and mg=14, we 
obtain 


Kray 1.13% 45 = 1.45 Mev. 


17. Recoil 


The law of conservation of momentum helps one to easily under- 
stand the fundamentals of recoil in gunfire, reaction propulsion, 
and other similar phenomena. 

We shall consider, in the first place, recoil taking place in a frame 
of reference where the bodies are at rest at the initial moment. 
In the case of gunfire, this assumption is in complete accord with 
the prevailing conditions. If at the initial moment a system con- 
sisting of two or more bodies is at rest, the total momentum of 
the system is equal to zero. Irrespective of the future course of 
events, the total momentum continues to be equal to zero. Thus, 
if at some instant an explosion takes place, causing the system to 
be divided into parts having masses mi, mg, m3, ..., Which fly 
asunder with velocities vi, və, U3, ..., the total momentum mv +- 
+ ilgv2-+-mMv3-+ ... of the scattered bodies must be, as before, 
equal to zero. 

In the case of gunfire (where the system divides into two parts), 
the condition that the momentum of this system of two bodies be 
equal to zero has the form mv +MV = 0. Here, the lower-case let- 
ters refer to one body, say the missile, -and the capital letters to 
the other—the gun. The division of the system into two parts can 
only take place along a straight line. We can, therefore, dispense 
with the vector notation and write the condition in the form mv = 
=—MV. The velocities of the gun and the missile are inversely 
proportional to their masses. Thus, the greater the mass of the 
Se with respect to the mass of the gun, the greater the observed 
recoil. 

The phenomenon of “continuous recoil’’, occurring in reaction pro- 
pulsion, is of exceptional interest. It is the subject of a distinctive 
branch of mechanics that may be called the mechanics of variable 
mass. This phenomenon does not only occur in jet planes. Indeed, 
we can point to a number of commonplace occurrences involving 
such motion. As examples, it is sufficient to mention the case of 
an uncoiling roll of paper or the fall of droplets continuously con- 
densing in the atmosphere (see the example at the end of this sec- 
tion). The fundamentals of the mechanics of variable mass were 
developed at the end of the nineteenth century by Prof. I. V. Me- 
shchersky. Since we cannot describe his work here, we shall restrict 
ourselves to the consideration of a single problem in this field—a 
problem related to the possible velocity of motion of a rocket. 
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A rocket moves with a velocity v and at some instant ejects 
a certain amount of combustible gas having mass dM. The mass of 
the rocket, naturally, decreases by this amount. If the velocity of 
the ejected gas is designated by u (this velocity is not given with 
respect to the rocket, but with respect to the inertial coordinate 
system in which the velocity of the rocket motion is described), 
the momentum of the matter escaping from the rocket will be equal 
to udM. The rocket decreases its mass and increases its velocity by 
the amount dv. The momentum of the rocket after ejecting the fuel 
is equal to (W —dM) (v-+-dv). In accordance with the law of con- 
servation of momentum, we can equate the momentum Mv of the 
rocket before discarding a portion of the fuel and the momentum 
of the system after that quantity of gas has been ejected. The latter 
is equal to the difference between the momentum of the rocket 
and the momentum of the portion of fuel. Thus, 


Mv =(M—dM) (v-+dv)—udM. 


Whence, excluding second-order infinitesimals, 
eee ae 
M 

But w+v is the relative velocity of the outflowing combustible 
gas (with respect to the rocket). Designating this velocity by c, we 
arrive at the following equation for the increment in-the rocket’s 
velocity: dv= me . The minus sign is used to show that the 
velocity increases when the mass decreases. It can be seen that the 
increase in velocity is equal to the fraction of the lost mass mul- 
tiplied by the relative velocity of the ejected fuel. 

Taking the velocity of the outflowing gas with respect to the 
rocket to be a constant value, the above equation can be easily 
integrated. If the mass of the rocket was Mo when the velocity of 
the rocket was vo, and became equal to M when the velocity of 
the rocket changed to v, integration yields 


v 
aM 
\ w= —c \ i? 
vo Mo 
in. 
Mo 
v—v=c¢ ln -ir 


The latter formula was initially obtained by K. E. Tsiolkovsky, 
the first to design a rocket and do research in the theory of inter- 
planetary travel. 

Going over to common logarithms and introducing the designa- 
tion m= M)—WM for the difference in the mass of the rocket, i.e., 
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for the mass of the ejected fuel, we obtain Tsiolkovsky’s formula 
in the form 


v=c X 2.3 x log (1+47) 


(the initial velocity vp is assumed to be equal to zero). 

In modern rockets, the velocity of gas outflow is probably not 
less than 2,000 m/sec. Using this velocity in the formula, the 
following table of values is obtained: 

al Ore5 Aa E 40.0? sata) 154 999 
v (m/sec) 446 4,386 3,218 4,817 7,013 8,000 413,815: 


As can be seen from this table, the rocket velocity increases 
much slower with respect to the amount of ejected fuel than one 
would like. To give the rocket a large velocity, a tremendous amount 
of fuel, relative to the initial mass of the rocket, must be ejected 


Thus, if a velocity of 7 km/sec is imparted, less than of the ini- 


tial rocket mass will remain. a 

A velocity of about 14 km/sec must be imparted to a rocket if 
it is to escape from the Earth’s gravitational pull. This figure is 
obtained in the following simple manner. To escape from the Earth, 
a rocket must possess sufficient kinetic energy to perform the work 
of moving a body from the Earth’s surface to infinity. But this 
work against the force of gravity is equal to the difference between 
the rocket’s potential energy at the Earth’s surface and t infinity. 
Since at infinity the potential energy is equal to zero, the condition 
for escape from the Earth has the following simple form: 


mv? _ mM 


2 Rye 
where M and R are the Earth’s mass and radius, respectively. 
Multiplying the numerator and the denominator of the right-hand 
member of the equation by R, then substituting vor by g, the 
acceleration of the gravitational force at the Earth’s surface, and 
cancelling the rocket’s mass, we obtain the condition for escape 


from the Earth: v = V 2gR, which yields a figure of about 14 km/sec. 
If we assume that the velocity of the gas outflow is 2,000 m/sec, 


the ratio 7 can be obtained from Tsiolkovsky’s formula. It is 
equal to 244. For the rocket to escape from the Earth, its design 
must be such that only ae of the rocket’s mass before take-off 


will remain in its interplanetary flight. The problem is an excep- 
tionally difficult one. Basic advances in the field of astronautics 


oo 
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will be achieved, therefore, when methods of producing rocket 
engines with considerably larger velocities of fuel ejection are 
found. If we were to succeed in increasing this velocity by a factor 


of three, i.e., increasing this velocity to 6 km/sec, the ratio= would 


fall to 5.3. 
It is easier to put an Earth satellite into orbit because a smaller 


initial velocity is required. If we assume that the acceleration of 
the gravitational force at the heights at which we desire the satel- 
lite to orbit is appoximately the same as at the Earth's surface, 
then the law of mechanics for artificial planets will have the form 
mg = ma, and since the satellite moves in a circle, the centripetal 
2 . P : : 
force is a = >: Thus, the velocity of the rotating satellite is v= 
=V gR, i.e., 8 km/sec. When such a velocity is imparted to 
a rocket, it is transformed into an Earth satellite. From the above 


table, we see that the value of ar required for imparting a velocity 
of 8 km/sec to a rocket is 54. 


Example of motion of a body with variable mass. Consider a water droplet 
falling in an atmosphere saturated with water vapour. At the instant of time ż, 
the droplet has a mass m and a radius r. During the time dt, the yolume of the 
droplet, and hence the mass (for a density equal to 1), increases by nr? dr. 


n ý 4 . dm dr i Ee 
Thus, the rate of increase in mass 1s a ute are At the same time, it is 


dı i 
clear from physical considerations that T the rate of condensation of the 


water vapour, must be proportional to 4zr*, the condensation surface. Hence, 
ar d r= kt, where k is some constant of proportionality. 


a const an 
Let us derive the equation of motion of this droplet in the Earth’s gravi- 
ted in the change-of momentum d (mv), which 


tational field. We are interes angi 
according to the fundamental law of mechanics is equal to Fdt, where F = mg. 


Thus: 
d > AICO dm 
F= sp (mv): Me GER mg=m-z t v ane 


Substituting the expressions for m and r, we obtain 
dv, 30 


By integrating this equation, we arrive at the following result: v = ft, ie, 


IRA z 2 4 
the droplet falls with the constant acceleration a Eeto cm/sec?. The resist- 


ance of the air was not taken into consideration. 


CHAPTER IV 


ROTATION OF A RIGID BODY 


18. Kinetic Energy of Rotation 


In this chapter, we shall be concerned with “perfectly rigid“ 
bodies. This means that we may neglect any deformation occur- 
ring during the motion of such a body, and assume that the dis- 
tances between the particles of the body remain unchanged. 

Let us consider a rigid body rotating about a fixed axis passing 
through it (Fig. 25). We can con- 
ceive of the body as consisting of 
small volumes of masses Amy, Ama, .. - 
at distances ri, rs,... from the axis of 


values of the distances are the various 
velocities of motion Vi, vo, ... We are 
interested in the kinetic energy of rotation 
of the entire rigid body, which is com- 
posed of the kinetic energies of the indi- 
vidual particles Amı, Ams, ..., i.e., 


KER Ari a , anii Bi. 

The velocity of angular motion of any 
point of the body can be easily expressed 
in terms of w, the angular velocity of the 
rotating-body. If the body turns through 
an angle dp in the time dt, the deriva- 


; dp . 
tive a is called the angular velocity: 


Fig. 25 


ap 
dt * 


For the case of uniform motion, the above formula is transformed 
into a relation already known to the reader, namely, o =~. The 


quantity œ is usually measured in radians per second. If the body 
performs 4 revolution per second, its angular velocity is equal to 
27 rads/sec. 

Different points of a rotating rigid body have different veloci- 
ties v (called linear velocities), but the angular velocity œ is the 


rotation. Corresponding to the various ' 
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same for each point. In turning through an angle dg, a point de- 
scribes an arc ds = rdọ. Dividing both members of this equation 
by the time of motion dt, we obtain a relationship between linear 
and angular velocity: 

v=or. 


Thus, the formula already known for uniform motion is valid in 
the general case. 

Using this relation, the expression for K,.¢ may be written in 
the following form: 


@? 72 TI 
Kroi => (r2Amy + Ame: +. « +). 


The quantity in brackets does not depend on the velocity of motion, 
but is a measure of the inertial properties of a body executing 
rotational motion. The greater the value of the expression in brack- 
ets, the greater the energy that must be expended to achieve 
a given velocity. Therefore, the quantity 

I=rAm, +r: Am +... 


is called the moment of inertia of the body, and the term r?Am is 
the moment of inertia of a particle. The quantity I may be ex- 


pressed more briefly as follows: 
I= f r?dm, 


where the integration (summation) encompasses all the particles 


of the body. i : 
The formula for a body’s kinetic energy of rotation acquires the form 


Io? 
Kroi= 7: 


This formula is valid for a body rotating about a fixed axis. For 
a rolling body (a ball, wheel, etc.), the energy of motion will consist 
of the energy of rotational and translational motion. Thus, if 
a rolling body has a mass M, moment of inertia J, translational 
velocity v and rotational velocity œ, the kinetic energy is 


Mv? To? 
Koua | on 


at this formula is valid for any arbitrary 

theoretical mechanics, it is shown that 
lways be resolved into translational and 
in this case, must be considered 
through the centre of mass. 


Moreover, it turns out th 
motion of a rigid body. In 
any arbitrary motion can a 
rotational motion. The rotation, 
with respect to an-axis passing 
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19. Moment of Inertia 


If we carefully examine the formula for moment of inertia, we 
see that the value of J depends on the nature of the distribution 
of the mass with respect to the axis of ro- 
tation. The particles that are far from the 
axis of rotation contribute considerably 
more to the total value than those that are 
close to it. 

Let us calculate the moment of inertia of 
a flat disk, of radius r, relative to the axis 
perpendicular to the plane of the disk and 
passing through its centre (Fig. 26). The mass 
of an annular element of radius 2 is dm = 
= p  2nadz, where p is the density of 
Fig. 26 the disk’s material. This ring has a mo- 
ment of inertia dI, = dma?, and the mo- 

ment of inertia of the entire disk is: 


r 


T 
4 2 
I= | dT, =| px 2n x a%de = 2np 5 = E 
ô 


2 
0 


It is evident that with respect to the same axis the moment of 
inertia of a ring is I, = mr, i.e., Ip = 2I;, when the entire mass 
is concentrated at the outer circumference. 

The moment of inertia of a body will vary in accordance with 
the location of the axis of rotation. If a thin rod rotates about its 
long axis, the moment of inertia will be very small, for all the parti- 
cles lie very close to the axis of rotation and,, therefore, all the 
quantities rj, 73, ... entering in the formula for 7 will have very 
small values. The moment of inertia will be much larger if the rod 
is rotated about a line perpendicular to its axis. 

The moment of inertia depends on the ori- 
entation of the axis and the location of the 
point through which it passes. If no specific 
stipulation to the contrary is made, it is assumed 
that the axis of rotation passes through 
the body’s centre of mags. d 

If the axis of rotation is displaced relative to 
the centre of mass by the amount a (Fig. 27), 
I, the new moment of inertia, will differ from 
Io, the moment of inertia with respect to the 
parallel axis passing through the centre of mass. 

In view of what was stated at the end of 
the previous article, we can express the kinetic 


——— Oe N 
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energy of a body rotating about the displaced axis as the sum 


> Mv? o? 
Keo >) 


where v is the velocity of motion of the centre of mass and is equal 
to aw. Thus, 

B Ma?w? Igo? 2 

pum S Ma: 


pj 3 


Hence, the moment of inertia Z with respect to a parallel axis, 
displaced by a distance a from the centre of mass, can be expressed 


as follows: 
IT=Iy+ Ma. 


It follows that the moment of inertia with respect to an axis pass- 
ing through the centre of mass is always the smallest possible for 
a given orientation. Depending on its symmetry, a body will have 
one, two or three moments of inertia with respect to the main axes 
passing through the centre of mass. 

Thus, a disk is characterised by two axes passing through its 
centre—one lying in theplane of the disk and the other perpen- 
; : n 8 r2 s mr? 
dicular to the disk. The moments of inertia are then iia an > p 

4 
respectively (it is assumed, naturally, that the distribution of mass 
throughout the disk is uniform). For a ring, the moment of inertia 

2 z 
about similar axes is mE and mr?, respectively. 

For all solids of revolution, it is sufficient to know the moments 
of inertia with respect to two axes. In the case of a body of arbitrary 
form, to completely describe the inertial properties of the body 
during rotation, it suffices to know three moments of inertia with 
respect to axes passing through the centre of mass, namely, Imax— 
the largest moment of inertia, Imin— the smallest, and Imean—the 
moment of inertia with respect to an axis perpendicular to the 
first two. 

The only bod $ 
axes is the same is a sphere. For a sphere, Z = = mr. 

e formulas for moment of inertia are calculated from 


y for which the moment of inertia about all the 


The abov 
the relation: 


f= f r?dm. 


To use this formula, it is generally necessary to be able to operate 
with multiple integrals. Examples of such calculations are given 


in courses on theoretical mechanics. 
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As we shall see below, physicists are sometimes interested in 
the values of moments of inertia for molecules. Since the mass of 
atoms is concentrated in nuclei whose dimensions are very small, 
the calculation of the moments of inertia can be accomplished 
without difficulty, for the atoms may be considered as point masses. 

For a diatomic molecule, the moment of inertia with respect to 
the axis passing through the atoms is equal to zero. For the axis 
perpendicular to the line joining the atoms we obtain 


` 


I= marat merh, 


where r4 and rpg are the distances of atoms A and B of a diatomic 
molecule to the centre of mass. If l is the distance between atoms, 


r m 
Tat Tr, =l and ao a - Therefore, 


fe anon 45 
mama 


The moments of inertia of more complex molecules may also be 


calculated as the sum of the moments of inertia of the atoms con- 
sidered as point masses. 


Examples. 1. The flywheel of a ship’s engine has a mass of about 4 ton, 
a diameter of 2 metres and, therefore, a moment of inertia J ~ 1,000 kg m?. 
Making 300 rpm, the flywheel possesses a kinetic energy of rotation — 


K =Z ~ 500,000 joules ~ 50,000 kg-m. 


2. The moment of inertia of the Earth is about 1015 gm cm? = 1088 kg m?. 
The kinetic energy of rotation of the Earth about its axis is 2.5 Xx 1029 joules. 

3. In a molecule of hydrogen Hy, the distance 1 = 0.753 X 1078 cm, the 
mass of the hydrogen atom my = 1.6598 x 10-24 gm and, therefore, the moment 
of inertia of the molecule with respect to the axis perpendicular to J is 


2 
1= THE 0.46 10-40 gm cm? 


[j 


20. Rotational Work and the Fundamental Equation 
of Rotation 


If a body fixed on a shaft is 


made to rotate by a force F or, on 
the other hand, if a rotating 2 


RE as otating body is braked by the force F, the 
kinetic energy of rotation increases or decreases by the magnitude 


of the expended work. Just as in the case of translational motion, 
this work depends on the effective forces and the displacement 
produced thereby. However, the displacement is now angular, 


at 
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and the expression that we know for the displacement of a particle 
by a certain distance is not applicable here. 

To find the formula that we are interested in, let us refer to 
Fig. 28. The force F is applied at a point located at a distance r 
from the axis of rotation. The angle between the direction of the 
force and the radius vector is designated by 0. Since the body is 
perfectly rigid, the work of this force (even though applied at one 
point) is equal to the work expended in rotating the entire body. 
In rotating the body through 
an angle dg, the point of ap- 
plication traverses the path 
rdo and the work dA, equal to 
the product of the projected 
force along the direction of 
displacement and the magni- 
tude of the displacement, is 
then 


dA= Fr sin0 dọ. 


Fr sin 0 is known as the 
moment of force or torque: M = Fr sin 0. From the diagram, it is 
seen thatrsin 0 =d, where dis the shortest distance between the 
line of action of the force and the axis of rotation. Hence, 


M= Fd, 


i.e., the torque is equal to the product of the force and the lever arm. 
The formula for work that we have sought is 


dA=Mdg. 


The work of rotating a body is equal to the product of the effective 


torque and the angle of rotation. ; LOAI 
Strictly speaking, the formula is only valid for an infinitely small 


angle dp. However, we may use it in any case if we understand 
M to mean the average value of the torque for the time of rota- 


tion. Then, - 
AA = Mado. 


The work of rotation goes to increase the kinetic energy of rota- 
tion. Hence, the following equation must hold: i 
Io? 
Mdọ=4 (45) ; 


If the moment of inertia is constant for the time of motion, then 


Mdo = To do 
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or, since O= En 
SS BOE cee? 
dw 
M=I-;- 


This is the fundamental equation of motion for a rotating body. 
The torque acting on a body is equal to the product of the moment 


S s š dw 
of inertia and the angular acceleration -7y 


Examples. 1. The torque on a wheel of a locomotive developing a traction 
of about 10,000 kg is about 300 kg-m ~ 3,000 newton x metres. 

A man riding a bicycle produces a torque of about 100 newton metres 
on the pedals. 

2. By means of an example, we shall show the connection between the 
expression for the kinetic energy of a moving rigid body (see p. 75) and the 
fundamental law of mechanics. 

Let us consider a spool of mass m and radius 
r, possessing a moment of inertia / with respect 
to its axis and wound with weightless thread 
(Fig. 29). The free end of the thread is fastened 
at a certain height above the Earth’s surface. 
The spool is allowed to fall under the action of 
its own weight mg. Hence, the equations of mo- 
tion for the spool are: 


and 


where T is the tension of the thread and œ is the 
angular velocity of rotation of the spool. Elimi- 
nating 7, we obtain for the acceleration: 


dv g 
mg q+ 
Fig. 29 


mr? 


If time is counted from the moment the spool be- 
gins to fall, then in é seconds the spool will fall a 


y v2 i r 
distance h ae It is evident that the total kinetic energy of the spool at 
that instant is equal to the change in the potential energy of the spool: 


2 
K=mgh=mg > ‘ 


Substituting the expression for a, we obtain 


2 2 
mv Jo 


Ber ae 
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21. Angular Momentum 


The similarity between the formulas of motion of a particle and 
the derived laws for the rotation of a rigid body is immediately 
evident. Thus, compare the following formulas: 


Particle Rotating body 
dv do 
F=may M=I=> 
K=25 tc. 


Clearly, the physical concepts are also analogous. While in the 
mechanics of particles the acceleration is determined by the force, 
in rotational motion the angular acceleration is determined by the 
moment of force, i.e., the torque. The role of mass is played by 
the moment of inertia, which in rotation is the measure of a body’s 
inertia (the mass alone is insufficient here for this purpose). This 
similarity encourages us to go a step further and assume that anal- 
ogous physical quantities are related by analogous relations. 

In the previous chapter, it was established that the momentum 
p=m isa physical quantity satisfying the law of conservation in 
a closed system. The quantity analogous to p is the moment of 
momentum (angular momentum): 

N=Io. ` 
It can be rigorously proved that angular momentum satisfies the 
law of conservation, i.e., in a closed system, the total angular 
momentum of the bodies belonging to this system does not change. 
An increase in the angular momentum of one of the bodies is com- 
pensated for by an equivalent decrease in the others. 

The relation 

ILo, + [202+ La3-+ ++ = const 
has many interesting applications that are in many ways analogous 
to the problems studied in the previous chapter. 

The law of conservation of momentum when applied to a single 
body has the form mv = const and is, therefore, identical with the 
law of inertia. Even in this simple case, the law of conservation of 
angular momentum leads to an interesting result. A single body, in 
the absence of interaction with its medium, must satisfy the condition 

Io = const. 
However, the moment of inertia of a body may change during 
motion. It is, therefore, evident that an increase in J must be accom- 
panied by a decrease in œ, and vice versa. 

One can cite numerous examples, and this phenomenon can be 
strikingly demonstrated by means of a swivel stool. Holding a pair 
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of dumb-bells in your hands, be seated on such a stool (Fig. 30). 
Place your arms outstretched in a horizontal position and have 
someone give you a small rotatory push. Motion takes place for the 


, Fig. 30 


particular moment of inertia J at an angular velocity o. Now, fold 
your arms on your chest. As a result the moment of inertia drops 


Fig. 34 


sharply to I’. Since the product Jo remains unchanged, Jo = l'o’. 
Thus, changing the position of one’s arms leads to a considerable 
increase in the velocity of rotation. The process may be repeated— 


` 
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stretching one’s arms out leads to retarded motion and folding 
them produces accelerated motion. 

Decreasing the moment of inertia as a method of increasing the 
velocity of rotation is quite familiar to gymnasts and dancers. It 
is used in all kinds of jumps, tumbles and spins. Thus, a ballet 
dancer in a position of large moment of inertia will impart velocity 
by changing her posture to a position of small 
moment of inertia (Fig. 34). 

Rotational recoil is usually demonstrated by 
means of the aforementioned swivel stool and 
a wheel fixed on a long axle (Fig. 32). While 
standing on the stool and holding the wheel 
above one’s head, the wheel is twirled by 
means of a sudden movement. As a result, 
the stool rotates in the direction opposite to 
that of the wheel. This is precisely what is 
meant by recoil. J101, the angular momentum 
of the wheel, is balanced by Z202, the angular 
momentum of the stool with the person stand- 
ing on it (the angular momenta have oppo- 
site signs). This is due to the fact that in the 
initial state both the stool and the wheel did 
not rotate, and the total angular momentum 
was equal to zero. 

An inelastic impact was defined above as 
an encounter between two bodies as a result 
of which the bodies move together. Some- 
thing analogous may be demonstrated in the R 
case of rotation using the equipment just de- 8. 
scribed. The wheel is made to rotate and is 
then transferred to a person standing on the stool. Thus, the initial 
state is the folowing: the stool and the person standing on it are 
at rest, while the bicycle wheel is rotating with a momentum J101. 
Now, the person on the stool takes hold of the wheel. The angular 
momentum J,@; cannot disappear, but it now belongs to the entire 
system. Naturally, the person on the stool and the wheel rotate 
together in the same direction as the wheel was rotating. Clearly, 
Tyo, = (l+) o. HE before “unification”? the person rotated with 
a velocity >, the angular momentum to be conserved is J101 -4 


+-In@. Therefore, 


Ty; +1202 
Tyo, Tae = (at 2) © Of =~ eT * 
This is very similar in form and in content to the expression for 


inelastic impact. 
6* 


a 
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Examples. 1. The flywheel of a ship’s engine has a moment of inertia of 
cg m? 
1,000 kg m? and at 300 rpm its angular momentum is ~ 30,000 


2. A billiard ball whose radius is 2.5 cm has a moment of inertia J = 
= 250 gm cm? and moves with a velocity of 5 m/sec without skimming the 
table. Its angular momentum is then ~50,000 gm cm?/sec = 5 X 107? kg m?/sec. 

3. The angular momentum of the Earth in rotating about its axis is 
~10%4 kg m?/sec. 


22, Free Axes of Rotation 


Let us assume that a body has received angular momentum about 
some axis to which the body is fastened. Further, let us assume 
that the fastening is then removed. While the angular momentum 
must be conserved (naturally, neglecting friction), the orientation 
of the body in space may 
change. If this occurs, 
and as a result there is a 
change in the moment of 
inertia, it will be com- 
pensated for by a corre- 
sponding change in the 
angular velocity. 

However, in a num- 
ber of cases the nature 

k of the rotation does not 
Fig. 33 change. Stable rotation 
! takes place about the 
original axis, just as if the axis of rotation were fixed as before. 
Theory and experiments show that there are two axes passing through 
the centre of mass that may be permanent, free axes of rotation, 
namely, the axis of maximum moment of inertia and the axis of 
minimum moment of inertia. 
If the fixed axis of rotation passes through the centre of mass 
(Fig. 33), but is inclined to the axes of symmetry and, therefore, 
to the afore-mentioned orientations, then after the fastening is 
removed, the body begins to change its orientation with respect to 
the axis of rotation. It can be seen from the figure that the reason 
for the change of orientation is the fact that the centrifugal forces 
form a couple of forces. The body will còntinue changing its orien- 
tation until the axis of rotation becomes a free axis. 

It can be shown in a number of ways that a freely rotating body 
will keep changing its axis of rotation until the rotation occurs 
about a free axis. Tying bodies of various shapes to one end of 
a string, and attaching the other end to the shaft of a rapidly rotat- 
ing motor, we can transmit rotary motion to a body without having 
a fixed axis of rotation. In Fig. 34, the successive orientations of 
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a rotating hoop, chain and match box are shown. The match box 
will begin rotating about its shortest or about its longest edge. 
Theory shows that rotation about the axis having a mean moment 
of inertia will not be stable even if this axis is an axis of symmetry. 


y 


¢) 


Fig. 34 


In constructing one of the first turbines, atlempts to fix the position of the 
shaft with sufficient accuracy to eliminate the couple of centrifugal forces 
acting on the bearings at a velocity of 30,000 rev/min were unsuccessful. At 
such high velocities, these forces are intolerably great. The problem was solved 
by using a flexible shaft for the turbine wheel. The rotation took place 
about the free axis and the flexible shaft adapted itself to this axis. 

Let us consider this phenomenon in eect more detail. We shall desig- 
nate the shift in the centre of gravity of the turbine wheel due to the wheel’s 
asymmetry by a and the amount by which the shaft sags under the action of 
the centrifugal force by A. The shaft sags in the direction of the asymmetry. 
Hence, the expression for the centrifugal force may be written in the form 
4n2n2M (a+ A). This force is balanced by the elastic force kA, where k is 


the stiffness of the shaft, Thus, 


pense St. 
A=a iE aX . 
4n2n2M 


he number of revolutions per minute n is large 
ase, but tends to become equal to minus a, the 
measure of the wheel's asymmetry. This means that when the angular veloc- 
ity of the turbine increases the total displacement of the wheel with the shaft 
from the axis of rotation tends to become equal to zero. Herein lies the adapta- 
bility of the flexible shaft: It can bend, without breaking, by the amount re- 
quired to eliminate the centrifugal force. P 3 3 

From the above formula, it follows that the condition k/án?n? M = 1 is 


critical, for the relation shows that the shaft’s sag becomes infinitely large. 


This is the instant of resonance which must be rapidly passed in running the 


la ik 
turbine (the external frequency n= pg T’ 
cl of mass M placed on a shaft of stiffness k; 


The formula shows that when t 
the shaft’s sag A does not incre: 


i.e., n coincides with the 


natural frequency of a turbine whe 
see Chapter V). 
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23. The Gyroscope 


The term gyroscope usually denotes a device that can rotate about 
any orientation of its axis. If a gyroscope is rotated and then not 
interfered with, its axis of rotation will remain unchanged as long 
as no forces act on it (Tœ, in this case, should not change). 

The action of a force on a gyroscope’s axis of rotation manifests 
itself in a somewhat surprising manner. This may be demonstrated 
using a gyroscope that is balanced by a load in such a manner that 
the axis of the device is horizontal (Fig. 35). The gyroscope is 


Fig. 35 Fig. 36 


rotated in the vertical plane and a load G is placed on the 
horizontal bar. It would seem that the entire right-hand portion, 
i.e., the gyroscope, should move upwards. Indeed, this would be 
the case if the gyroscope were not rotating. Actually, the rotating 
gyroscope begins to move with constant velocity about the vertical 
axis as shown by the dotted line and the arrow. This motion is at 
a right angle to the direction of the applied force. 

This phenomenon, revolving of an axis of rotation about the 
direction of an applied force, is called precession. Everyone is famil- 
iar with the precessional motion of a top. As soon as the axis of 
the top begins to deviate in the least from the vertical, a gravitation- 
al torque begins acting on the top, tending to topple it. A stationary 
top would fall, but a rotating top begins to precess about the verti- 
cal. The axis of the top will then describe a cone whose vertex is at 
the point of support of the top. 

In general, the rotation of a top is even more complex, for nuta- 
tions are superimposed on the precessional motion. These nutations 
are due to small jolts (which are always present) that make the 
top shake (Fig. 36). As a result of the nutation effect, the axis 
describes a cycloidal curve, as shown in the figure, instead of a cir- 
cle. It should be noted, however, that nutational effects are usually 
very weak. 


CHAPTER V 
VIBRATIONS 


24. Small Deviations from Equilibrium 


The motion of a body or particle about an equilibrium position 
is often encountered in nature. Thus, a small load on a string oscil- 
lates back and forth, a spring quivers, and an atom in a crystal 
lattice vibrates. 

If the body or particle on which forces are acting is in a position 
of equilibrium, its potential energy is a minimum and the system is 
in a potential well (Fig. 37). When the deviation from the equilibri- 
um position is not large, we are . 
concerned with only a small portion u 
of the potential well. A potential 
curve near the equilibrium position 
can always be approximated by a 
parabola, i.e., it can be written in 
the form U=+ ka*. Here, + kis 
the constant of proportionality. The 


factor + has been introduced for _ 

convenience and} its purpose will S aa 

presently become clear. f f 
The reasoning used in arriving at 4 

the above relation is the follow- Fig. 37 

ing: Potential energy isa function 

of the displacement from the equilibrium position. As is well known, 

making the proper assumptions, any function may be expanded in 

a Taylor series for small values of z. The exponent of x increases 

consecutively from term to term: 


SS 


U=ax+5 keba? ert- g 


However, for small x, the terms of higher power may be neglected 
and, if the potential well is symmetrical, the first term vanishes, 
for the potential energies at equal distances to the left and right 
of equilibrium are equal. / 

The force acting at a point deviating from the equilibrium posi- 


tion is equal to minus the derivative of the potential energy. Thus, 
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if the energy is expressed by the formula u=+ kz?, then F =—kz. 


The meaning of the negative sign is clear, namely, the force in 
question always restores the body to the equilibrium position and 
is always directed oppositely to the displacement. Consequently, 
the force F = —kz is called the restoring force and the coefficient X 
is sometimes called the restoring force constant. 

What is the nature of the motion under the action of the restoring 
force? Newton’s law, which is written in the form ma = —kzx for 
motion near equilibrium, should give us the answer to this question. 

This equation is satisfied if the point undergoes harmonic vibra- 
tion about the equilibrium position, i.e., vibration in accordance 
with the relation 


x= Acos = t, 


where T is the period of vibration. 

Let us verify this statement. The velocity of motion of the point 
for the indicated dependence between displacement and time is 
dz 21A si 2n 
di q aay 


It should be noted that the maximum value for the velocity of 
the vibrational motion, i.e., the amplitude of the velocity, is 


Ue = aa Let us now determine the acceleration by taking the 
derivative of the velocity. We obtain 


e án Qn n 
a= —-pr 4cos-7 t- 
Substituting the expressions for acceleration and displacement in 
Newton’s law, ma = —kz, we obtain 
42 27 ‘oe on, 
ma Acos 7t kA cos yt 


We see that the factors depending on time cancel out. Hence, the 
equation for harmonic vibrations satisfies Newton’s law for small 
deviations from equilibrium. 

It is noteworthy that Newton’s law places a constraint on the period 
of the vibrations. As can be seen from the last formula, the period of 


the free vibrations about the equilibrium position is T= ™ - 
This period is determined by the vibrating system—the restoring 
force constant Æ and the mass of the particle. It is, therefore, under- 
standable that this period is called the natural or characteristic 
period of the vibrating system. ; 
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No restrictions are placed on the amplitude A of the vibrations, 
with the exception, of course, that the deviations from the equi- 
librium position must be small. 


25. Particular Cases of Vibrations 


In view of the fact that we deal with two types of potential ener- 
gy in mechanics, namely, elastic and gravitational, it also becomes 
possible to divide mechanical vibrations into these two cases. 

Bodies vibrating under the action of an elastic force usually 
perform linear vibrations of compression and extension. However, 
torsional vibrations are also encountered. 

If a body suspended from an elastic band, spring or wire is dis- 
placed from the equilibrium position along the band, spring or 
wire axis, linear vibrations arise under the action of the elastic 
restoring force. The coefficient Æ is, in this case, the stiffness of 
the vibrating body. 


To what extent this coefficient determines the resulting period and fre- 
quency of vibration is seen from the following example. Identical loads, whose 
masses are equal to 1 kg, are suspended from three springs having different 
stiffnesses. Under the action of these loads, the springs are elongated by 1 mm, 
1 cm and 1 metre, respectively. The coefficients of stiffness will then have the 
following values: 


981x103 w dynes, , _ o dynes , 
u= 0A =0.981 x 10? — 3 Kg =0.981X108 —— 3 
ka = 0.981 108 S985 


The periods and frequencies of the vibrations are: 


m 103 4 -2 8 * 
T,=2n ap on Vissi ator =8-34x10 sec, Vy = 15.8 cps; 


Ta=0.2 sec, Vo=5 CPs; 
T,=2sec, Vg=0.5 cps. 


For torsional vibrations, the restoration to equilibrium takes 
place under the action of a torsional moment that is directly pro- 
portional to the angular displacement for small deviations from 
equilibrium. If, for example, a massive disk having a moment of 
inertia J is suspended from a wire, and the wire is twisted by some 
angle or other, the equation for the torsional vibrations of the 


disk will be J ae — Do. The torque D, relative to unit angular 


displacement, corresponds to the restoring force constant, and the 
moment of inertia corresponds to the mass. Thus, the period of free 


"MOE 
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‘torsional vibrations is represented by the formula 


TEn +. 
The greater the moment of 


inertia, the lower the frequency of the 
vibrations. 


Ezapmle. Assume that a disk Raving a mass of 100 gm and a radius of 
5 cm is suspended from a steel wire and that the period of the torsional 


A A 3 mr? 
vibrations is 4 second. The moment of inertia of the disk is Iy= 


2 


à ; OG eae 
=1,250 gm-cm*. Thus, the restoring force constant D— 24. 49,400 


TTE 
deron, If a disk of the same mass but of 1 cm radius is suspended 
T 


from the same wire, the period of the torsional vibrations will no longer 


be 4 sec, for Ty=2n Bx 0,2 sec. 


A body oscillating under the action of gravitational force con- 

stitutes a pendulum. If the pendulum may be approximately repre- 

sented as a point mass suspended from a weight- 

less wire, we call it a mathematical pendu- 
lum (Fig. 38). 

From the figure, it is easily seen that the 
expression for the restoring force ae a, 
i.e., the component of the weight along the. 
tangent to the path. If the deviation from 
equilibrium is small, the sine of the angle may 
be replaced by the value of the angle a or by 
the quotient obtained when the displacement 
x is divided by the wire length Z. In this ap- 
proximation, displacement along the chord is 
assumed to coincide with displacement along 
the arc. Thus, the restoring force is equal to 


mg > and the restoring force constant is equal 


to a In the expression for the period, the 


non 
mass of the bob cancels out and T = 271 i 


m E 
4 The fact that the period of a pendulum does 
Fig. 38 not depend on the mass is an example of a com- 
mon feature of particle motion in a gravitation- 
al feld. Since according to the law of ‘gravitation the force 
acting on such a particle is proportional to the mass, the mass can- 
cels out in the equation of motion. Thus, we have arrived at the 
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well-known result that, for a given location in a gravitational field, 
the period of a mathematical pendulum depends only on its length. 

The measurement of the period of a pendulum may be used to’ 
determine g. The value of this measurement may be determined 
extremely accurately so that very minute variations in the value 
of g may be ascertained. Various methods of determining the Earth’s 
shape and various gravimetric investigations are based on this 
measurement. (Small changes in the value of g, which, however, 
greatly exceed the limits of experimental error, may occur due to 
seams of various density below the Earth’s 
surface.) 

When the small oscillations of a physical body 
cannot be approximated by a point mass, the pen- 
dulum is called a physical pendulum. Fig. 39 shows 
a rigid body whose axis of rotation (oscillation) 
passes through it. The period of the physical 
pendulum is calculated by the same formula as 
for torsional vibrations: 


room V5, 


since the equation 


dw) 
== — Do Fig. 39; 


is valid for the motion of any body rotating about an axis. However, 
in the case of the gravitational field, we can easily express the torque 
relative to unit angular displacement by a more direct pendulum 
characteristic. From the same figure, it can be seen that the torque 
is equal to mgr sin &, i.e., the product of the weight of the body, the 
distance r from the centre of gravity to the point of suspension, and 
the sine of the angle of deviation from the equilibrium position. 
Since the deviation from the equilibrium position is assumed to be 
small—as always in this section—we obtain the expression mgr œ 


for the torque; whence, D = = mgr. Thus, the period of 


a physical pendulum is given by 


The quantity V” = = is called the equivalent length of the phys- 
ical pendulum. This is the length that a mathematical pendulum 
would have for such a period. , 


k2 
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26. Transformation of Energy. Damped Vibrations 


If there is no friction, the total energy e of a body naturally 
remains unchanged for vibrations about its equilibrium position. 
Since potential energy is usually expressed relative to an arbitrary 
level, we shall assume that the potential energy in the equilibrium 
position (displacement z = 0) is equal to 
zero. At any instant of motion, 

In the equilibrium position, the kinetic 
energy is a maximum. In the end po- 
sitions, the body comes to a standstill 
(@=0 and z= A) and the potential 
energy is a maximum. It is evident from 
this, incidentally, that 

k42 


e=- 


, 


Fig. 40 i.e., the vibrational energy is propor- 
tional to the square of the amplitude. 
For the three springs considered in the example on p. 89 assuming the 


amplitudes of the oscillations are the same, i.e., A = 0.4 cm, the total vibra- 
tional energy will have, respectively, the following values: 


e,=0.49X105 ergs; e2=0.49Xx104 ergs; e,= 49 ergs. 


This discussion has not taken into account the frictional force, 
which, as a rule, is experienced by all vibrating bodies. Such ideal 
vibrations will continue for ever without change in amplitude. 
Friction, however, produces damped vibrations. Formally, in this 
case too, it is possible to write the displacement equation in the form 


z=Acoswt, 


but A is understood to decrease with time (Fig. 40). To determine 
how A depends on time, the frictional force must be known, i.e., 
fir must be known for every instant of time during which vibrations 
occur. A simplifying assumption, more or less satisfied in practice, 
is that the frictional force is proportional to the velocity of motion: 


frau, 
where the coefficient a is known as the resistance constant. 


For a ball having a radius of 0.53 mm, the resistance constant œ at about 


15°C is 13.93 gm/sec in glycerine, 0.35 gm/sec in sulphuric acid and 0.01 gm/sec 
in water. 
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The energy equation can now be written in the form 
de = —av dx 


and the vibrating particle continuously loses an amount of energy 
equal to the work of the resisting force. Hence, the equation of 
motion is written as follows: 


ma= —kx—av. 


2 


By substitution, it is not difficult to show that this equation is 
satisfied by the equation x = A cos œt when the amplitude A de- 


creases exponentially with time: 
a 


A=Aje am 


Here, Ag is the amplitude at the instant of time t = 0. 
Tt should be noted that the ratio of two successive amplitudes 
is a constant. Thus, the expressions for the amplitude after n—1 


and n periods, respectively, are 


a a 
— =~ (n—-1)T — -nT 
An = Apem andA,=Aye 2" . 


Let us divide the former relation by the latter. The ratio : 
Hra “om. 
er m 


does not, in fact, depend on 7. The rate of damping is sometimes 
expressed by the logarithmic decrement ô: 


ES Ana i 
On a =z i 


Thus, the damping is greater, the greater the resistance constant, 
the smaller the mass, and the greater the period. 

It should be noted that the period of damped vibrations differs 
from the period of free vibrations. The same calculation that leads 
to the formula for the time dependence of the amplitude also yields 


the following relation for the period: 


1 
DS fae 
Ve 


that, for small resistance, T differs little from To = 


This means 
= 2n 1/ 7. When the resistance increases the period increases and, 


finally, for 
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vibrations cease. We say, in this case, that the body displaced from 
the equilibrium position returns aperiodically to this position. 

Here are some approximate values for the logarithmic damping 
decrement of certain vibrating systems: 


Acoustical vibrating systems .... . 0.1 
Electrical oscillation circuits . . . . 0.02-0.05 
NOOTE seep eat Seis, uae a 10-3 
N OTELE aa Sonn TN E 10-74-10-75 


Let us consider several examples of damped vibrations. r 
ʻa) Vibrating tuning fork. Logarithmic decrement b= T= 40-8, 


Assume that the period of the vibrating tuning fork is 7 = 0.01 sec. Then, 
= 0.4 sec4, This means that during the time 2m = 10 sec the amplitude 


of the vibrations decreases by the factor e: 
a 


- t 
Ar=Ape ?™ ; At=10= Aven. 
The quantity oh = q is called the time constant of the given vibrating system. 


b) In acoustical vibrating systems, as can be seen from the above table, 
the logarithmic damping decrement is large. This means that the vibrations 


are rapidly damped. If 6 = = T =0.1, the amplitude of the tenth vibration, 


Ato, will already be less than the initial amplitude Ay by the factor e. 
us, - 


1x10 
Ao Ar Ago _ fam A ioe 
Ay Az ``’ Ay Aso Piet? Ai 


c) The change in the period of damped vibrations may be conveniently illus- 
trated by means of a spring. Let a load having a mass m= 50 gm be suspend- 
ed from a steel spring, which is thereby elongated by 2 cm. Thus, the stiffness 
of the spring is k = 24,500 dynes/cm. If there were no damping, 


m 


To=2n p= 0-28 sec. 


Assume that the damping is such that the time constant 1L= pli 5 sec, 


i.e., the resistance constant œ = 2 i i i 
ears 0 gm/sec. The period of the vibrations then 


F j s 
m= = =~ To (1+4. 08 x 10-5), 
He 


Let us now immerse this pendulum in liquid. Assumi i i 
J 3 P uming the t t in 
tis case to be Tə = 1 sec, the amplitude of the fourth Parano wil ARIY 
e 1/e of the initial amplitude, i.e., there is considerable damping: 


Ta ~ To (14-102 x 10-8) ~ 1.00179. 
Thus, even in this case the period increases by only 0.1 per cent. 
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27. Forced Vibrations 


If a body is displaced from its equilibrium position and then 
not interfered with, the vibrations occur at the natural frequency 
of the body, independent of the nature of the excitation, i.e., the 
vibrations are determined only by the properties of the system. 
The frequency of the vibrations of a string remain the same regard- 
less whether the sound was made by the string being plucked or 
struck. 

At the same time, a number of means exist of “locking” the vibra- 
tions of a body to an external frequency. Such forced vibrations 


aw 
@ 
Fig. 41 Fig. 42 


may take place if two bodies capable of vibrating are coupled. One 
of the bodies will force the other to vibrate. A motor that is improp- 
erly balanced will execute vibrations that are transmitted to the 
foundation, i.e., the foundation will execute forced vibrations. 
We can perform the following experiment: A pocket watch is placed 
in a small box and suspended by three strings. As a result, the box 
passes into a state of forced vibration. In Fig. 44, a device is shown in 
which a rotating eccentric makes a pendulum pass into a state of 
forced vibration. In all these cases, a periodic force varying with 
some frequency œ acts on a body. Such a force is aptly called an 
external force. 

Forced vibrations do not set in immediately. A certain amount. 
of time must elapse before the body coupled to the vibrating system 
begins to vibrate. Eventually, a particular amplitude is reached 
and the frequency of the vibrations will be exactly equal to œ. 

The fact that a body has a natural frequency of vibration po. 
nevertheless affects the phenomenon of forced vibrations. To be 
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more exact, as we shall directly see, the natural frequency and 
the external frequency differ significantly. Fig. 42 shows the 


r W 
dependence of the amplitude of forced vibrations on the ratio EFI 


0 
for three systems having different amounts of friction. When the 
external frequency and the natural frequency coincide, the amplitude 
of the forced vibrations is a maximum. This phenomenon is widely 
known as resonance. r 

The curves shown in Fig. 42 may be determined theoretically. 
The equation of motion of a body executing forced vibrations, 
under the action of a periodic external force Fo cos wt, has the form 


ma = — kz — av -+ Fo cos ot. 


By substitution, one can easily show that the displacement of 
a vibrating point will satisfy the equation 


z= Acos (wt +f), 
where the amplitude 


ee 
Vm (oF — 022+ a0? 
and the phase shift B satisfies the equation Ths 
__ =o Ake 
tan ĵ = m (0—0?) * i 
b : ae € dz 
Taking into account that a = di and v = ar let us substitute these 


values in the equation of motion. After sim 
containing cos wé and sin wt, we obtain 


[(—mo?-+ k) A cos B—aoA sin B — Fo] cos wt — 


ple conversion and grouping terms 


—[(—mo?-- k) A sin B-+a@4A cos f] sin of =0. 


Since the obtained equation must be valid for every instant of time, the coeffi- 


cients of cos @¢ and sin w¢ must be equal to zero. Thus, we obtain two equa~ 
tions for determining A and p: 


[(—mo?+- k) cos B—aw sin B] A = Fo 
[(—mo?- k) sin B -+o cos p] A=0. 
Squaring both equations and adding, we obtain 
; = A 
Vo oTe 


ies 
where Og = V2 is the frequency of the natural vibrations. From the second 
equation, we obtain the phase shift p: 


— awn 
aoo 
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From the first formula it follows that the amplitude A depends 
on w as follows: When œ<% the amplitude increases as © increases; 
when @ = @) the amplitude reaches a maximum; and when 
© > oo the amplitude decreases as © increases. This effect (sharp- 
ness of resonance) is more pronounced, the smaller the resistance 
constant a. When there is little friction, the resonance disrupts 
the system, for at a=0 the resonance amplitude goes to infinity. 
Engineers must take this into account in their calculations. 
To design a structure so that it is in- 
sensitive to the vibrations of its foun- 
dation, a resonance curve similar to 
the one shown in Fig. 43 must be 
available. The lower curve shows the 
vibrations of the foundation and the 
upper one, of the structure. At reso- 
nance, which occurs when the period 
of the vibrations is 0.32 sec, the amp- 
litudes reach a value of 20-25 microns. 
This, in general, is no small amount. 

The sharpness of resonance is an in- 
dicator of still another important 
phenomenon, namely, the sharper the 
resonance, the slower vibrations of 
constant amplitude set in. 

Another feature of forced vibrations 


Upu 


Amplitude 
ENF 


D 
IS 


is the presence of phase shift. Until 0 
now, we have assumed that the origin 030 040 
of the coordinate system was so select- Period (sec) 


ed that with respect to t=0 the maxi- 
mum displacement is in the positive 
direction. Naturally, if we are consid- ] 
ering only one vibration, there is no need to select any other ori- 
gin. However, if we are comparing two vibrations and pick the ori- 
gin so that z = A when ¢ = 0, then the displacement of the other 
vibration at this particular instant may have an arbitrary value. 
This circumstance may be taken into consideration by introducing 
the phase shift 6 in the argument of the cosine. Thus, if x = 
= Acos (wt + B), then z=A cos f at the instant of time <=0. The 
phase displacement is uniquely described by means of the phase 
shift p. 

eat by now return to resonance phenomena. The quantity B in 
the formula for forced vibration indicates that the phase of the 
forced vibration, generally speaking, is shifted with respect to the 
phase of the impressed vibration. The magnitude of the phase 


Fig. 43 
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shift depends on 2 , the ratio of the natural frequency to the exter- 


nal frequency, and also on the damping. Fig. 44 shows that a 90°- 
phase shift occurs at the resonance frequency, independent of the 
damping. The effect of the damping becomes clear when the situa- 


-8 


Phase LAG 
NI 


a i 
Fig. 44 


_tion somewhat removed from the resonance condition is considered. 
For weak damping (small logarithmic decrement 6), at frequencies 


Fig. 45 


somewhat below resonance, the phase shift is almost zero, while 
at frequencies somewhat above resonance, the phase shift is almost 
180°. The same tendency exists for heavy damping, but it is not 
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so pronounced. For a small amount of friction, one can say that 
a 180°-phase shift occurs when the frequency passes through the 
resonance condition. : 
A simple experiment (Fig. 45) will demonstrate the essence of 
| these interesting relationships. Suspend a weight by a string and 
allow it to swing freely. When the period of the free vibrations of 
this pendulum is manifested, stop the pendulum and by periodic 
motion of the hand bring it into a state of forced vibration. 
| At first move the hand rapidly; so that the period of the natural vibra- 
tions is greater than the period of the forced vibrations; then move 
it slowly, so that the period of the natural vibrations is less than 
the period of the forced vibrations. It will be seen that in the first 
case the pendulum and the hand are 180° out of phase, while in 
the second case they are in’ phase. 
; 


| Let us again consider thespring on page 94, which has a damping constant 
a = 20 gm/sec, a mass m = 50 gm and a period To= 0.28 sec (Oo = 
| = 22.4 sec). When an external sinusoidal force of frequency © = @o acts 
on the spring, the amplitude of the forced vibrations is equal to A = 4 cm 
| when the amplitude of the im pressed force is Fo = 1,790 dynes ~1.8 gm. 
| A deviation of the frequency of the impressed force from @o results in a change 
| of the amplitude of the forced vibrations and a change in the phase shift B 
| between the vibrations of the spring and the external force. The table shows 
the data obtained for various deviations by means of the formulas derived 


| in this section. 


| 
l Freq. of orterna] force Aupiitnde’o i Tore Phase angle — f (degrees) 
2 3.58 0°05’ 
10 3.95 0°35’ 
15 4.48 0°45” 
22.4 4.04 90° 
30 2.48 178°50’ 
40 1.31 479°40* 


It is seen that in the presence of damping the maximum amplitude of the forced 
vibrations is reached when the frequency of the impressed force is some- 
what less than the natural frequency of the vibrations. The weaker the damp- 


ing, the smaller this shift in frequency. 


28. Seli-Sustained Vibrations 


Fig. 46a shows a trough of triangular cross-section fixed on 
a shaft about which it can rotate. The trough has some particular 
period of free oscillations, which may be observed by swinging the 
trough away from its equilibrium position. The oscillations will 
continue as long as friction and air resistance do not stop them. 


7* 
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Let us place the trough under a water faucet and allow the stream 
of water to flow evenly on the wall of the trough, at a point some- 
what removed from the centre line. It is not difficult to envisage 
what will ensue. As more and more water pours into the trough, 
the height of the centre of gravity rises until, finally, it exceeds 
the height of the shaft to which the trough is fixed. The pressure 
of the stream of water is now sufficient to upset the trough; where- 
upon, water flows out and the 
trough returns to its original 
position. This cycle keeps re- 
peating as long as the stream 
of water continues to flow. 
Thus, the trough will oscil- 
late. However, the character 
of the oscillations produced in 
this manner is quite different 
from the oscillations consid- 
ered above. 

In the first place, it is im- 
portant to note that the exter- 
nal force is not of an oscillatory 


$ nature; i.e., it is a constant 
= force (the pressure of a stream — 

= of water). Secondly, such a 

S system executes un ped 

€ Time —> oscillations, although 10 

b) to the action of friction and 

other resistance. And finally, 

Fig. 46 the resulting oscillations are 


not harmonic, i.e., they do not 

have a sinusoidal shape. Thus, 
in our example, the similarity to a sinusoid is nil. By conducting 
such an experiment, it can be shown that the dependence of the 
amount of water in the trough on the time may be represented by a 
saw-toothed curve similar to that shown in Fig. 46b. 

The oscillations described above may be classified as self-sustained 
oscillations. Such oscillations constitute a distinct phenomenon, 
basically differing from free, undamped oscillations occurring with- 
out the action of a force, as well as from forced oscillations occurring 
under the action of a periodic force. The above example may appear 
to be artificial. However, self-sustained systems have broad 
application and are very often encountered wherever mechanical 
and other oscillations occur. 

A simple pendulum clock (Fig. 47) executes self-sustained oscil- 
lations. As is well known, such a clock is actuated by a falling 
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weight suspended from a chain that passes over a gear wheel. This 
wheel is located on the same axis as a balance wheel, which can 
mesh with a symmetrical anchor escapement. A pendulum is rigid- 
ly fixed to the escapement. At the instants when the balance wheel, 
which is driven by the gear wheel via a gear drive, touches the 
pallets of the escapement with its teeth, the pendulum is given 
an impulse. The rest of the time the pendulum and the escapement 
swing freely, while the balance wheel moves by itself. The escape- 
ment and the balance wheel are so constructed that the pendulum 
obtains two impulses each time the 
balance wheel advances by one 
tooth. One impulse is obtained when 
the pendulum moves from left to 
right, and the other when it moves 
from right to left. 

The self-sustained vibrations of 
a clock are basically similar to 
those of the trough of triangular 
cross-section. The vibrations occur 
under the action of a constant rather 
than a periodic force, are undamped 
in spite of the presence of friction, Fig. 47 
and are not harmonic. 

In the above examples, a common property of self-sustained 
vibrations is manifested, namely, the property known as feedback. 
A pendulum executes undamped oscillations and causes a mechanism 
to give an impulse at appropriate moments. The mechanism pushes 
the pendulum and the pendulum provides feedback to the mechanism. 
If the pendulum stops, the impulses also cease. The oscillations of 
the pendulum are governed by the pendulum itself. 

In exactly the same manner, the swings of the triangular trough 
are governed by the trough. The stream of water regulates the 
swinging of the trough, while the construction of the trough itself 
regulates the water inflow. \ 

A string struck with the fingers and then released is in a state of 
free vibration. The situation is different when a string is drawn 
with a bow. In this case, the string executes self-sustained vibra- 
tions that are saw-toothed in shape. The bow pulls the string along. 
When the displacement reaches a certain limit, the string separates 
away from the bow, returning to its original position. The bow 
again pulls the string along and the process is repeated. In the 
space of the second that the musician draws the bow, the phenom- 
enon is repeated hundreds of times. These are typical self-sustained 
vibrations since they are due to a continuously acting force. The 
string itself controls the vibrations by its elasticity. 
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The squeaking sounds emanating from door hinges in need of 
oiling also belong to this class of vibrations. : 

We say that feedback occurs whenever an instrument or machine 
automatically introduces automatic corrections to its action when 
the operating conditions change. The principle of feedback is one 
of the fundamental concepts in automation. 


29. Addition `of Parallel Vibrations 


In a number of cases, the problem arises of analysing the motion 
of a body simultaneously executing two vibrational motions. Thus, 
an oscillating pendulum may 
be located on a vibrating plat- 
form, or it may be on a roll- 
ing ship. 

If we are concerned with 
vibrations in a single direc- 
tion, the addition occurs as 
shown in the model in Fig. 48. 
Two pendulums, in this case, 


oscillate in paralle planes. 
A light rod lies freely on the 
pendulums and a recording 


pen is attached at its centre. 
As an approximation, we can 
assume that the pen will re- 
main ina plane differing little from the planes of vibration of the pendu- 
lums and that the displacement of the pen at a given instant will 
be equal to the algebraic sum of the pendulum displacements. Another 
arrangement may also be used, e.g., a ball oscillates on a spring 
suspended from a board and the board, in turn, is attached to a post 
by a spring in such a manner that the ball simultaneously executes 
two different vibrations in a single plane. 

If z, is the displacement of the first vibration in the absence 
of the second, and z, the displacement of the second vibration in 
the absence of the first, then, at each instant, for simultaneously 
occurring vibrational processes, 


Fig. 48 


T= T4 F Tz. 


In the most general case, the component vibrations may differ 
in amplitude, frequency and phase. 

Let us first consider the case when the vibrations have equal 
amplitudes and frequencies, but are displaced in phase. Then, 


zı =Á cosot, z, = A cos (wt + ọ) 


—— SS 
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and 


T= t4 + Tz = 2A cos cos (+5) À 


2a . . . . 
where o=. This means that the resultant vibration is also 


harmonic and has the amplitude 


2A cost 3 
Hence, it follows that the amplitudes of vibrations add arithmeti- 
cally when the vibrations coincide in phase and subtract when 
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Fig. 49 


they are opposite in. phase (p = 180°). In the intermediate cases, 
the amplitude assumes a value between zero and 2A. In particu- 
lar, when p= 120° the amplitude of the resultant vibration is 


equal to A. This is illustrated in Fig. 49. 
Another important case is the addition of vibrations having 


different frequencies. For simplicity, let us assume that p = 0 and 
the amplitudes are equal. Then, 
zı = Acos wt, X,—=Acos@t, and 


z= 2Acos on Oe t cos oe th 


the vibrational motion obtained when such 


In the general case, al n ined wh 
does not exhibit a distinct periodicity with 


vibrations are added 
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respect to the displacement z. However, two particular cases de- 
serve special consideration. : 

First, let us consider two vibrations whose frequencies are close 
to each other. Then, o, — @2 < @; + @, and the displacement z is 
the product of two cosines, one varying rapidly with time and the 
other very slowly. Hence, 

2A Cogs ee 
may be considered to be the slowly varying amplitude of vibra- 
-- Wo 
tions occurring with an average frequency Oar = e, The fre- 


quency of the slowly varying amplitude is known as the beat 
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Fig. 50 


frequency. Fig. 50 clearly shows the two frequencies—the basic 
frequency of the vibrations and the beat frequency. 


The second important case is the addition of two vibrations 
when one of the frequencies is a multiple of the other. Tt is quite 


Fig. 51 


evident that the resultant vibration will be periodic. If, for example, 


the period of one vibration is 3 sec and that of the other 7 sec, the 


resultant vibration will be repeated every 24 sec. This is shown 
in Fig. 54. 


30. Vibration Spectrum 


We have already spoken about vibrations that repeat with preci- 
sion every specific interval of time, but are not harmonic. For 
example, we have considered saw-toothed vibrations. If we are 
sufficiently exacting, it turns out that harmonic vibrations, i.e., 
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those represented by sinusoids, are encountered in’ nature and 
engineering much less often than nonharmonic vibrations. 

At the end of the previous article, we noted that the sum of two 
sinusoids is a periodic vibration, even though not a sinusoid, if 
one of the frequencies is a multiple of the other. Naturally, this is 
true for any number of harmonic vibrations and not merely for two. 


& Ş 3 A 1 
The sum of two vibrations having periods T and = T, respec- 


tively, is a vibration having a period 7. Furthermore, this is the 
period of the vibrations obtained by adding three vibrations having 


periods T T and + T, respectively; also, four vibrations—with 
the additional vibration having the period ir; five vibrations— 


with the additional vibration having the period + T; etc. Converting 


to frequencies, this may be expressed as follows: The sum of any 
number of vibrations whose frequencies are multiples of œ, i.e., the 
frequencies œ, 2@, 30, ..., is a vibration having the frequency ©. 
Now, the following question naturally arises: By. adding an 
arbitrarily large number of vibrations, whose frequencies are mul- 
tiples of œ and whose amplitudes are selected as required, is it 
not always possible to secure any desired vibration, even saw-toothed? 
the French mathematician, proved that this was 
indeed the case. The theorem named after him states that it is always 
possible to select a, @2, @3, ++ and P1, P2) Ps, ++. such a man- 
ner that any periodic vibration having the frequency @ may be 
represented in the form of a sum of harmonic vibrations: 


x = a; cos (@t + 4) + 42 Cos (20t + P2) + as COS (8@t+ p) +.. 


; i lled the fundamental frequency, and the fre- 
ee sae the overtones or harmonics (e.g., second 
harmonic, third harmonic, etc.). The closer the curve of the vibra- 
tions approaches a sinusoid, the smaller the amplitude of the harmon- 
ics. On the other hand, if the curve of the vibrations hardly resem- 
bles a sinusoid, the amplitudes of some of the harmonics will not 
differ greatly from the amplitude of the fundamental frequency. 

Representing the vibration in the form of a sum of harmonic 
vibrations is called spectrum analysis. The Petre ar of the 
data on the frequencies and amplitudes of the harmonic vibrations 


SY % A P These data may be pre- 
compr: the vibration of frequency ©. à iN 
sented in tabular form. If there are many frequencies, how ever, we 
usually resort to a graphical means of presentation (Fig. 52). j 

y þe extended to include nonperiodic 


Ai ; ectrum may , . . 
dea pe ae speak of the spectrum Of clastic: yibrations 


Fourier, 
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produced by a blow on a table, the spectrum of the report of a gun, 
or the spectrum of an outcry. 

To make this clear, let us first consider a process consisting of 
periodic, damped impulses. This is not the case of a single report 
or outcry, but of a series of reports or outcries, i.e., repeated at 
regular intervals of time. Characteristic of this process is the rapid 
damping of such a vibration, whose curve is shown in Fig. 53a. 
The spectrum of this vibration may be established by existing 
means and has the form shown on the right in the figure. As was to 
be expected, we see that the spectrum is composed of frequencies 
that are multiples of the fundamental. It should be noted that the 
spectrum has a maximum, which occurs for the eighth harmonic. 
This is not accidental, for if we examine the vibrations depicted 
on the left in the figure, we see that within each individual impulse 
the damped pulse vibrates at a “frequency” that is 8 times greater 
than the frequency of the fundamental tone (Fig. 53a). 

Similar impulses are illustrated in Fig. 53b, but in this case 
the frequency is one-half of the above. Compare the spectrum of 
this vibration with the previous one. Since the fundamental fre- 
quency is now one-half of the original, the “frequency” of the damped, 
elementary process (we have assumed that it has remained the same) 
will now be the 16th harmonic of the fundamental tone. The distri- 
bution of the harmonic amplitudes remains as before, but their 
number in the same interval of frequencies is two times greater. 

It is easy to see now that the spectrum of a nonperiodic proc- 
ess—a single impulse—is continuous. Individual frequencies are 
not discernible (Fig. 53c), but the nature of the spectrum is very 
similar to that considered above. 


A mathematical proof of the above conclusions is contained in 
the theory of Fourier integrals. 


31. Addition of Mutually Perpendicular Vibrations 


To analyse a complex vibration consisting of the sum of two 
mutually perpendicular vibrations, it is best to use an electronic 
oscilloscope. We shall discuss this apparatus in more detail below 
(p. 462). For the present, it is sufficient to note that an oscillo- 
scope enables us to depict the vibrations of an electron beam in two 
mutually perpendicular directions. The trace of an electron beam 
on a fluorescent screen describes a path that is the result of two 
mutually perpendicular vibrational motions of the beam spot. 

Let us assume that the vibration of the beam trace in the ver- 
tical direction is represented by the relation y = b cos (wt+ ô), and 
in the horizontal direction by the relation z = a cos wt. To deter- 
mine the nature of the resultant path, we must eliminate time from 
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the above equations and obtain an equation of the form f(z, y) = 0- 
Writing the expressions-for the displacements in the form == 
= cos ot, + = cos (@t+-6) = cos wt cos 6—sin ot sin 6 and 


replacing, in the second equation, cos wt by = and sin wt by 


EE „we obtain after simple conversion the equation of 
a 


an ellipse rotated with respect to the coordinate axes: 
z% o y2 2ry 


sin? 
aa p <p cos sin? 6. 


Let us now vary the parameters of the vibrations and see what 
happens to the ellipse. If we vary the phase difference, the ellipse 
will change its form and simultaneously rotate (Fig. 54) 


y y y 
d=0° o=90° ô=180° 
Fig. 54 d 


When the phase difference is equal to 90° the 


axes of the ellipse 
coincide with the coordinate 


axes. If the phase difference is decreased 

or _ increased, the ellipse 

Va aes begins to rotate to the left 

or to the right, respectively, 

and simultaneously contracts. 

When the phase difference is 

reduced to zero, the ellipse 

degenerates into a straight 

line. The various cases can 

a be checked by substituting, 

in turn, the values ô = 05 

90° and 180° in the above 
equation. 

If the amplitudes of the vibrations in the vertical and horizon- 

tal directions are equal, then for phase differences of 90° and 270° 

the path is a circle. There is a difference between these two phase 

differences in spite of the fact that the paths are identical. In one 

case, the beam moves around the circle in a clockwise direction, 


sA 
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while in the other, the motion is counterclockwise. To see this, 
let us return to the original equations. We obtain the following: 


for 90° x=acosot, y=bcos (ot + 90°); 
for 270° x=acosot, y= bcos (ot -+ 270°). 


The first pair of equations shows that for increasing time, at t = 0, 
the point having the coordinates x = a, y = 0 begins moving in the 
direction of negative y, i.e., clockwise. The second pair of equations 
shows that in this case the direction of motion is counterclockwise. 

In viewing an oscillogram, it will be observed that the ellipses 
do not stand still, but slowly shift as though a continuous change 
in phase were occurring. Careful observation shows that the ellipse 
does not rotate, but the curve being traced by the beam spot seems 
to continuously shift from one ellipse to another. This phenomenon 
occurs when the frequencies of the vibrations differ somewhat. In 
fact, a difference in frequency is entirely equivalent to a continu- 
ously changing phase difference. Let us assume that the frequency 
of the vertical vibration œz is Aw greater than the frequency of 
the horizontal vibration @,. Then, 


ot + 6 = ot + (Aot + ô), 


where the variable phase difference is within the brackets. 

If the frequencies differ considerably from each other, before the 
beam is able to describe the major portion of one ellipse the phase 
has already changed. As a result, the described curves look less 
and less like ellipses. Examples of these queer curves are shown in 
Fig. 55 and are known as Lissajous figures. The depicted curves 
are for a frequency ratio of 3 : 4. 


CHAPTER VI 
TRAVELLING WAVES 


32. Propagation of a Disturbance 


Every body is elastic to one or another degree, i.e., every body 
is able to restore itself to its original form after being distorted 
by a force of short duration. This property is responsible for the 
fact that every mechanical action is transmitted by a body with 
finite velocity. If a perfectly rigid rod, incapable of being deformed, 
existed, it would only be able to move as a unit, and the action 


Fig. 56 


of a force would dissipate in such a body instantaneously. If a per- 
fectly plastic body existed, deforming without in the least restoring 
itself to its original form, it would be incapable of transmitting 
any mechanical action whatsoever. 

In an elastic body a disturbance is transmitted successively 
from one particle of a body to a contiguous one. The compression 
produced at the end of a rod due to the blow of a hammer is prop- 
agated along the body with a definite velocity c. If a bend of short 
duration is created at some point of a rigid body, this disturbance 
will also be transmitted with finite velocity through the body. The 
same is true of every deformation. Propagation through a body 


for various mechanical deformations is 
usuall a 
eae of a spring (Fig. 56). Papen Ap oY 
Elasticity of compression and extension is a iqui 
i roperty of liquid 
and gaseous bodies, as well as of rigid DOdicsertiants take. ie: 


turbances may be transmitted in all bodies. However, the disturb- 


ances produced by shearing, torsional and bending deformations 


ai eee only by rigid bodies possessing the corresponding 
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For compression and extension, the motion of the particles is 
in the direction in which the mechanical action is transmitted. 
Hence, the propagation of the disturbance is said to be longitudinal. 
In the case of shearing, torsion and bending, the direction of motion 
of the particles may form, generally speaking, any arbitrary angle 
with the direction in which the energy is transmitted. 

We can always select the direction in which the mechanical 
action is being transmitted and then resolve the displacement of the 
particles of the body into three mutually perpendicular components, 
whereby one of the components is in the direction of propagation 
and the other two are in a perpendicular plane. Thus, in the most 
complex case, we can consider the propagation of a disturbance as 
consisting of the sum of three motions—one longitudinal and two 
lateral. 

The velocity of propagation of the disturbance due to an elastic 
deformation depends on the mechanical properties of the body. 
As is shown in theoretical physics, this velocity is related to other 
physical constants of the body. Thus, for longitudinal waves, it 
is given by the simple formula: 
pat 

Vox ` 
Here, p is the density of the body and x is the compressibility. 
A large density leads to an increase in the inertia of the body’s 
particles and, consequently, the velocity of propagation of the 
elastic waves decreases. The small values for compressibility indi- 
cate that even large elastic forces correspond to only small deforma- 
tions. The smaller the compressibility, the greater the velocity of 
propagation of the disturbance. 

This is the form in which this formula is generally used for liquids. 
Thus, water compresses by 5 X 10-° of its volume for a change in 
pressure of 1 atm. This means that the compressibility, equal by 
definition to , 

mta -5 cm? 
oS 55 (sve p. 157), is 10-° apie x5 x 10-5. 
The density of water is 1 gm/cm*. Hence, for the velocity of prop- 
agation in water, we obtain , 
ca —=2 x 101 cm?/sec?, 
c= 1,400 m/sec. 

For gases, it is convenient to convert the formula for velocity 
into another form. Since the process of transmitting compression 
in a gas is very rapid, the compression and expansion of a gas may 
be considered to occur adiabatically, i.e., without heat exchange. 


c 
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We shall derive the equation of the adiabatic process below (p. 170), 
from which it is easy to obtain the following relationship between 
the coefficient of compressibility and the pressure of the gas: x = 


=+ | where yw 1.4*. Thus, c= V = .For an ideal gas, the density 
yp? i p 


p= Eis proportional to the fraction E, where p is the mass of 
v 


r . . v 
a mole of gas and v is its volume. This is so because T = const. 
Hence the velocity of propagation in a gas is 


c= yat. 


Here, a is a constant whose value is easily calculated by means 
of equations considered below (p. 170). 

Thus, the velocity of propagation of the disturbance due to a defor- 
mation in a gas, including the velocity of propagation of sound 
waves, which will be discussed in more detail later on, is proportion- 
al to the square root of the temperature and does not depend on 
the pressure of the gas. The dependence on the molecular weight is 
interesting. In hydrogen, the velocity of propagation is equal to 
1,263 m/sec, while in air, as is well known, it is 334 m/sec. 

For longitudinal waves propagating in a rigid body, the coeffi- 
cient of compressibility is usually replaced by the modulus of 
elasticity. Since, by definition, the modulus of elasticity 

pat. nA 


Sr ea 


it is evident that in the absence of transverse motion wat, for 
the linear compression is equivalent to the volume compression. 


The formula for velocity may then be written as follows: 

Z 

a 

* The equation of the adiabatic process is pv’ = const. If p and v are the 


equilibrium values of the pressure and volume for a certain mass of gas, and 
pt Ap and v — Av the corresponding values at the instant of deformation, 
hen 


(p-++Ap) (v— Av)’ = pv’. 


whente dA (1 a FY Si e a (2 ce 


1x2 v 


Dis- 
regarding terms of higher order in the binomial expression, we obtain 


Ap= — yp ae . Hence, Yale i 
v > vp 
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The table shows the extent of the agreement between calculated 
and experimental values: 


"x D y. alc. xp’ 
Young’s Modulus E as Ese $ E | $ Go 
Glass: . « « . 2 = ome 7.8x105 kg/cm? 2.4 5,700 5,990 
Steal’... kedy whe 2 TE 2,200 kg/mm? s 8 5,200 5,000 
Wood! oaaae A 7.2105 kg/cm? 0.7 4,130 4,200 
Water (13°C) .---- %=4.75x 1075 atm! 1 1,450 1,440 


To check the formula for the velocity of propagation of sound, it 
is necessary to use samples in the shape of slim rods. This is due 
to the fact that a more thorough analysis of the problem indicates 


that the formula c=V A is only valid for such bodies. For bodies 


having other shapes, as well as for the propagation of sound in 
a continuous medium, theory leads to expressions which we shall 
not introduce. 

It should also be noted that the values given in the table are 
only for guiding purposes. The velocity of sound in different types 
of glass, wood, steel, etc., differ considerably. 


33. Generation of Wave Motion 


Sustained vibrations may be applied to a particular point of 
a body or medium in a variety of ways. A force acting periodically 
at some point of a body produces a periodically varying deforma- 
tion whose disturbance is transmitted at a specific velocity from 
one point of the body to another. All the particles of the body 
the vibratory motion. Since the velocity of prop- 
however, the particles of the body are set into 
vibration in consecutive order. If a body is infinitely large, such 
a vibration will advance continuously, forming a travelling wave. 

Infinitely large podies do not exist. However, the actual length 

does not affect the nature of the phenomenon, for 


of a large body Apt es Ge os 
fie G E do not reach the end in view of inevitable ener- 


Ur consider a wave travelling in a particular direction in 
a body that for all practical purposes is infinitely large. Assume 
that the particle located at the origin of the coordinate system is 
vibrating in accordance with the equation y = A cos ot. Let us 
write the equation of vibration for a particle located along the 
line of propagation of the disturbance at a distance x from the ori- 


8—1409 


participate in 
agation is finite, 
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gin. It is not the same as for the particle at the origin because this 
particleSbegan to vibrate after a delay of t=, the time required 
for the disturbance to be propagated the distance x. The vibration 
of particle z, therefore, is shifted in phase with respect to the vibra- 
tion of the particle at the origin. At the instant of time ¢, the vibra- 
tion of particle x will have the same phase as the vibration of the 


particle at the origin at an instant of time Z earlier. Hence, the 
equation of vibration for a particle displaced by a distance x from 
the origin is 


y=Acoso («-=) j 


where "i is the phase shift. 


The above equation is known as the wave equation. It is valid 
for the vibration of any particle located at any distance from the 
origin. 

Let us assume that the source of the wave is far from the observ- 
er and that the wave front has long since moved ahead. We now 


—> Motion of the wave 


Motion of the wave-<— 


Fig. 57 


consider a portion of the line along the z-axis subject to wave 
motion. At first glance, the introduction of 
unjustified. To be sure, vibrations occur at 
However, can we discern how the wave i 
to theright orto the left? Careful observation shows th 
character of the wave motion is easily detected. 


ft. In the reverse 


ceeding instant, this sinusoid is displaced in its entirety in the 
direction in which the energy is being transmitted. 
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It is clear from this that thef the wav direction oe motion affects 
the form of the wave equation. If the wave moves along the coordi- 
nate axis in the positive direction, a minus sign must precede the 
value of the coordinate x. If the wave moves along the coordinate 
axis in the negative direction, the sign in the argument of the 
cosine must be reversed: 


Tt zx 
y=Acoso(¢—=); y=Acoso (t-+—) : 
pos. direct. neg. direct. 


The wave equation at an instant of time equal to a multiple of 
the period reduces to 
= ee DEA 
y=Aco0s0 A cos 2n -7 . 
The minus sign is dropped since the cosine is an even function. 
From this equation, it immediately follows that the period of the 


sinusoid is 
TEE HA 


This spatial period, i.e., the distance covered by an undulation 
before it repeats itself, is known as a wavelength. We have thus 
arrived at the well-known relation connecting the velocity of a wave, 
its length, and the period of a vibrating particle. ‘ 

A number of physical quantities vary sinusoidally in the undula- 
tory transmission of a disturbance through a body, namely, the 
displacement of a particle from its equilibrium position, the velocity 
of the vibrating particles, the pressure and the density. Hence, the 
above wave equation is very general. The quantity y may designate 
any of the enumerated physical quantities, which vary sinusoidally 
when the wave moves in the z-direction. It should be noted, of 
course, that the waves of pressure, velocity and displacement do not 
necessarily have to coincide in phase. For example, it is clear that 
the wave representing the velocity of the vibrating particles is shift- 
ed in phase by 90° with respect to the wave of the displacements, 
for the velocity of a particle is a maximum when it passes through 
the equilibrium position. 


34. Pressure and Velocity of Vibrations 


It is interesting to examine the relationships between the wave 
amplitudes of various physical quantities. For this purpose, we 
shall only consider longitudinal waves propagating in a gas and 
concern ourselves with waves of displacement, particle velocity and 
incremental pressure. Since the theory arose in connection with 
auditory waves, the incremental pressure Ap is often called the 

8* 
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sonic pressure and is designated by p, the symbol A being dropped. 
If A is the amplitude of the displacement wave, then œA is the 

amplitude of the velocity wave. These two waves are 90° out of 
hase. 

P We shall now derive the relationship between the amplitude of 

the velocity of vibrations and the amplitude of the pressure. From 

the general definition of x as applied to gases (p. 112), we obtain for 

sonic pressure the formula 


Av 
Be Ne 


where P is the pressure of the gas. Using the relation c? =¥~, we 
obtain 


It is perfectly natural that a direct connection should exist between 
the incremental pressure p and the relative compression in the gas 
at the same location. 

However, going a step further, the relative compression of the 


volume, © » can be related to the displacement amplitude of the 


vibrating particles. Let us mark two points, x, and 2», slots the 
line of propagation. For a longitudinal wave, changes in density 
occur as a result of displacements in the direction of propagation, 
Let us considera volume of gas bounded by the cross-sections through 
zı and z. When the wave moves, the molecules within this volume 
are displaced. However, it is only necessary to consider the situation 
at the limiting cross-sections. If the molecules of the layer through a 


are displaced by y; = A cos œ (¢ — 2) and the molecules of the 


layer through z, by Yz = A cos œ (: — z2), then the linear dimen- 
sion of the volume, £, — a, changes by the amount Y2 — yı. The 
relative change in length, 


and hence in volume, is @=" Going 


. . . to 04 
over to the limit, in order to obtain a quantity descriptive of a point 
in space, we obtain 
Av _ dy dx o 5 x 
Go eG Sui) (=>) 


and for the pressure 


é P=cp Aosina (:—=) k 
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This shows that the pressure changes in phase at the rate of the par- 
ticle vibrations in the wave. Am = Up is the amplitude of the veloc- 
ity of vibration. Thus, po, the amplitude of the pressure, is related 
to uo, the amplitude of the velocity, as follows: 

` 


Po = peuo- 


In acoustics, u is usually measured in cm/sec and p in dynes/cm?. 
Using these units, we obtain po = 4Auy for air at room temperature. 
The quantity pc is called the acoustic or wave resistance. The 
meaning of the designation is evident, namely, the greater the resis- 
tance the smaller the velocity of the vibrating particles for the same 


values of incremental pressure. 
The acoustic resistance of several materials is given in the table: 


| p c pe 

| (gm/em3) (cm/sec) (empemesce ) 
Glass 4 2.6 5.5 10 14 108 
Steel... .). = 7.9 5x 105 40 108 
Wood 0.7 4.2108 2.9 105 
Water 1 1.44X 105 1.4x105 


35. Energy Flux 


Wave motion transfers energy from one location in space to anoth- 
er. However, it should be kept in mind that every particle of the 
medium is involved in the transmission of energy and each con- 
tinuously vibrates about an invariable equilibrium position. 

Since all the particles of a body are involved in the vibration, 
the vibrational energy of a unit volume is 


Pümnax 
w= > 


where p is the density, i:e., the mass of a unit volume, and Vmax is 
the amplitude of the velocity of vibration. Substituting for the 


-latter quantity the familiar expression 
Umax = oA, 


acement amplitude and © is the frequency, we 


where A is the displ ( 
he vibrational energy of a body in the form 


can write the density of t 
pw2A? 


w = 


ae 
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This energy propagates with a velocity c. The following question 
now arises: What is the expression for the wave intensity, i.e., the 
amount of energy passing in unit time through a unit area perpen- 
dicular to the direction of propagation of the wave? Instead of refer- 
ring to the intensity of a wave, however, it is more usual to speak 
about the vibrational energy flux, meaning thereby the energy 
passing per unit time (power) through a given area. This approach 
is completely analogous to the analysis of the flow of water in a pipe. 
In a unit time, a wave traverses a path c and transmits the energy 
contained in a cylinder of length c and unit cross-section. Since 
a unit volume contains the energy w, the energy in the above cylin- 
der is we. This is precisely the meaning of wave intensity: 


I=we. 


We see that wave intensity has the sense of energy flow through 
a unit area. This was first noted by. N. A. Umov in his theoretical 
work on energy motion in bodies. 

Until now, it has been assumed that the wave motion propagates 
along a straight line. Such an assumption is useful when investigat- 
ing disturbances travelling along rods, strings, air columns, etc. 


However, we are also interested in investigating cases involving 
three-dimensional wave motion. 


To describe a three-dimensional wave 
wave front moves. The wave front at a particular instant can be 
determined if all the points in space having the same phase of vibra- 
tion are given. Noting the successive positions of this constant-phase 
surface, i.e., the wave front, we obtain a clear picture of the nature 
of the wave motion. Wie Say 

Generally speaking, this surface m 
case, then, what is meant by the direc 
It is natural to underst 
wave front. 


If the medium is perfectly homogeneous and the wave emanates 
from some point in the medium, the wave front is spherical. Such 
a wave propagates along the radii from the centre. At large distances 
from the centre of radiation, large portions of the wave front will 
appear to be in a plane—within experimental accuracy. In this man- 
ner, the concept arises of a plane wave propagating in a direction 
normal to the wave front. If the wave radiator is of linear shape, 
a cylindrical wave propagating along the radii of the cylinder is 
generated. Various types of waves are shown in Fig. 58. 


Disregarding all energy losses that occur during the motion of 
a plane wave, we obtain that the quantity of energy passing through 
Successive positions of the constant-phase surface remains unchanged. 
Hence, the intensity of a plane wave will not change during 


» we must know how the 


ay have any shape. In this 
tion of the wave propagation? 
and this direction to mean the normal to the 
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the process of propagation. The situation is different, however, 
with regard to spherical and cylindrical waves. Since the constant- 
phase surfaces increase in area as the square of the distance for 
spherical waves, and linearly as the distance for cylindrical 
waves, the intensities of these waves are inversely proportional to 


Fig. 58 


the square of the distance and to the distance, respectively. Only 
in this manner can the law of conservation of energy be satisfied. 

The intensity of a wave is proportional to the density of the vibra- 
tional energy, which is proportional to the square of the amplitude. 
Hence, the amplitude of a spherical wave is inversely proportional 
to the distance from the radiating centre, and the amplitude of 


a cylindrical wave is inversely proportional to the square root of 


the distance from the linear radiator. Thus, y= cos @ (t — 2) 
for a spherical wave and y=“; cos © (¢ = “) for a cylindrical 


wave. Here, the distance r, just as previously 2, is measured along 
the direction of wave propagation. 


Let us assume that a source of vibrations having a frequency of 4 ke/s is 
placed under water. The energy flux J = 1 watt/em?. Let us calculate the 
displacement amplitude A of the water molecules, their acceleration B and 
the amplitude @A = uo of the vibrational velocity. 

From the formulas of previous articles, it follows that 

1 


2I 
A=— = x10 cm; B=o Ve x107 uy ; 
o pe pe 


2I 107 cm 


w=] "oR" geet 
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eye m 
For water, c = 1,450 oe and p= 1 g z - Hence, 


cm 
Aw1.9x%10-3em; BaT; x125, 


For the same energy flux and frequency of vibration, we obtain the follow- 
ing results in air, where c = 330 a and p= 1.293 x 107%: 


cm m 
= ; -~ B=43 x 105 = 43,00 ; 
A=0.11 cm; ~ B=43 x 10 S002 43,000 sent 
Panom 
Ug = 700 Se 
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In actuality, waves propagating in a medium (solid, liquid or 
gas) decrease in intensity considerably more rapidly than’ indicated 
by the inverse-square law. This is due to losses in mechanical energy, 
i.e., transformation of mechanical energy into heat. 

The relation expressing the decrease in intensity of some particu- 
lar radiation in passing through a medium may almost always be 
obtained by reasoning as follows (for any medium and any radia- 
tion): If a wave passes through a layer of thickness dz, the intensity 
loss should be proportional to the intensity of the incident wave 
and the thickness of the layer, i.e., dl = — pldz. 

This equation may be integrated. Assuming that the intensity 
is equal to Jo at the point z = 0 and to J at the point x, we obtain 
a relation that is valid for finite distances: 


+= eae N dz, i.e., [=I gee. 
To 3 


Thus, the intensity of the wave decreases exponentially. 

In acoustics, it is convenient to use the concept of amplitude 
damping. Since the intensity is proportional to the square of the 
amplitude, the amplitude damping is expressed by a relation that 


differs from the above only in that the coefficient of dampińg (or 
absorption) is one-half of the value given there: 


1 
— sux 
A=Aje 2. f 
Let us examine the absorption coefficient u (ors n) somewhat 
closer. It is measured in reciprocal centimetres 


5 : (for the exponent 
must be a dimensionless quantity) and is equal 


to the reciprocal 


| 
| 
H 
| 
| 
| 
| 
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of the thickness for which the intensity or amplitude of the radia- 
tion decreases by the factor e. 

Naturally, the exponential damping relation is only a partial 
solution to the problem of the absorption of elastic waves by a medi- 
um. The determination of the dependence of the absorption coeffi- 
cient on the properties of the medium and the radiation frequency 
is a more formidable aspect of the problem. S 

Tt has been found that for many materials the damping of an 
elastic wave (most of the available data being for sound waves in 
air) increases with the frequency of the vibration. The relation for 
the absorption coefficient has been determined to be 


b= ao. 


For air, a = 4 X 10-7! sec*/em. Thus, over a distance of 1 km, 
a plane wave having a frequency of 100 cps decreases by a factor 
of ~1.015, while a very high audio frequency of 20,000 cps decreases 
by a factor of 1074! Ultrasonic vibrations are damped so rapidly 
- that their transmission over a distance of more than several hundred 
metres is completely impractical. 

However, the monotonic relationship between absorption and 
frequency is not always satisfied. Some materials exhibit selective 
absorption of sound in a relatively narrow frequency band. Thus, 
the absorption of ultrasonic waves by carbon dioxide is a maximum 
at frequencies near 277 ke/s. The parabola calculated in accordance 
with the formula u = aw? closely matches the experimental data 
in all regions except in the band indicated above. At frequencies 
close to 277 ke/s, the absorption is about 20 times greater than that 
calculated assuming a parabolic relationship. 

The dependence of the absorption coefficient on the properties of 
the medium can be expressed as follows for longitudinal waves in 
gases and liquids: the absorption coefficient is inversely proportional 
to the cube of the velocity of the elastic wave and directly propor- 
tional to the kinematic viscosity. As a result of this strong depend- 
ence on the velocity of propagation, and the fact that the kinematic 
viscosity of air is large, the absorption of sonic and ultrasonic waves 
in a liquid is about 1/1,000 of that in air. This means that for the 
same frequency elastic waves will propagate 1,000 times further 
in water than in air. z aN 

The absorption of transverse waves in solid bodies is also strongly 
dependent on the properties of the body. Thus, the absorption in 
rubber, cork and glass is, respectively, 13,000, 8,500 and 130 times 
greater than in aluminium. 

Due to their complexity, we S 
wave absorption in bodies. 


hall not go into the theories of elastic 
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37. Interference of Waves 


If there are several sources of waves instead of just one, then each 
point of the medium is simultaneously subject to several wave 
motions. It turns out that it is always possible to consider the vibra- 
tion of a physical quantity due to the action of several waves as the 
sum of the vibrations that would occur if each wave acted independ- 
ently. 

Let us assume that spherical waves emanate from two points 
located at a certain distance from each other. By means of the wave 
equation, we can determine the value of the vibration amplitude 
at any instant of time for any neighbouring point. If the point in 
question is located at a distance r, from the first source of waves 

_ and at a distance rg from the second source, the vibration there is 
represented by the formula 


y = Acos 2x (w—) + Acos 2x (w—2) A 


The result obtained by adding two vibrations differing only in phase 
is, as we know, also a harmonic vibration whose amplitude is 


ô A . y 
2A cos- , Which can be seen to depend on the phase difference be- 


tween the component vibrations. 


The phase difference 6 is equal 
in this case to ‘ 


~~ 
Hay 


Thus, generally speaking, all points of the wave field under con- 
sideration will be in vibration. However, the amplitudes of these 
vibrations will be different at different points. Two extreme cases 
deserve attention. First, let us consider the points at which the 


component vibrations annul each other. These points satisfy the 
condition’ 


rg 


nA = (2k 4-1) a, 


where k = Oa TE Sale er. i.e., the phase difference is equal to an 
odd multiple of x. On the other hand, if 


RET 
2a = 2 — kx, 


i.e., if the phase difference is equal to an even multiple of mt, the 
amplitudes of the vibrations will add arithmetically. Thus, in this 
case, the amplitudes reinforce each other to a maximum degree. 

The difference r; — rz may be called the path difference of the 
waves and this term needs no further explanation. The conditions 
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of maximum and minimum amplitudes may be formulated some- 

what differently by means of this concept. The maximum condition — 
ry —To= kh 

states that the path difference between waves arriving at a given 

point must be equal to an integral number of wavelengths. The 

minimum condition 


ry—m= 4 (2k-+4) 


states that the path difference must be equal to an odd number of 
half-wavelengths. These conditions are very easily visualised as 
follows: The waves rein- 
force each other when one 
crest is superimposed on 
another, and annul each 
other when a crest is su- 
perimposed on a trough 
or node. 

The superposition of 
waves, i.e., the addition 
of their amplitudes, leads 
to interference. 

From analytic geometry 
we know that a hyper- 
bola is a curved line sat- 
isfying the condition that 
the difference of the dis- Fig. 59 
tances from any point on 
the curve to two foci is a constant. If we pass a plane through 
the point sources and note in the diagram points of maximum 
reinforcement and those of wave annulment, the points will fall on 
hyperbolas. The corresponding curves are shown in Fig. 59. Such 
a picture can be easily observed on the surface of water when two 
sources sending out ripples from neighbouring points set up an 
interference pattern. 

We can use the above method to analyse the interference of any 


number of wave sources. 


38. Principle of Huygens-Fresnel. 
Reflection and Refraction of Waves 


The complete equality of all the vibrating points of a wave field 
is striking. They differ only with respect to phase. As a result, it 
is natural for the following idea to emerge: It should be possible 
to consider any point of a wave field as an independent source of 


spherical waves. 
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7 The validity of this idea first formulated in 1690 by Christian 
Huygens, may be tested by attempting to construct a wave front 
from data on a wave field on a boundary surface. It is necessary to 
take into account that there will be interference between the indi- 
vidual spherical waves (also called elementary waves or wavelets). 
Huygens’ principle, supplemented by Fresnel, shows that such 
a construction is possible. 

What is the significance of this principle? Let us assume that the 
wave falls on an opaque screen having several apertures. By means 
of the principle of Huygens-Fresnel, we can map the wave field 
beyond the screen without knowing anything about the sources of 


Fig. 60 


the field. It is sufficient to know the intensity of the field in the 
plane of the screen and assume that a spherical wave propagates 
from each point on the screen. The amplitude of the wave at any 
location in space is determined by adding all the wavelets coming 
from the apertures in the screen. ` 

Postponing consideration of the problems related to the passage 
of waves through a screen (problems which are mainly of interest 
in connection with light waves), we shall now apply the principle 
of Huygens-Fresnel to the explanation of the phenomena of wave 
reflection and refraction. 

Let us consider a portion of a plane wave incident on the bounda- 
ry between two media. As is well known, a wave of any origin is 
reflected at an angle equal to the angle of incidence. But why should 
this occur? Huygens’ principle gives the explanation. Every point 
on the boundary between the media may be considered as a wavelet 
source. The first wavelet emanates from the point first reached by 
the incident wave. Successive points on the boundary will 
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then be excited, the last point to start vibrating being the one 
last reached by the incident wave. Fig. 60 shows the positions 
of the wavelets for the instant of time when the incident 
wave reaches the last point. The wave front generated by the wave- 
lets forms an angle with 
the boundary equal to that 
of the incident wave. 
Thus, the propagation ve- 
locities of the incident 
wave and the reflected 
wave are the same. This 
means that the radius of 
the largest sphere must be 
equal to the path tray- 
ersed by the incident wave 
from the instant the first 
point was excited to the 
instant the last point was 
excited. 

In exactly the same 
manner, the wave front of 
a reflected spherical wave can be easily constructed. This construc- 
tion is shown in Fig. 61. In Fig. 62, a photograph is shown of 

a sound wave reflected by a 

wall. 

Let us now consider wave- 
lets penetrating into the sec- 
ond medium and generating 
arefracted wave front (Fig. 63). 
The different media have dif- 

_ ferent densities and elastic 
: properties. Hence, the wave 
e propagation velocity also dif- 

A -= fers for each medium. Let us 

ie . now perform the same con- 

OAA 4 struction as for reflection, i.e., 

draw wavelets on the diagram 

Fig. 62 for the instant when the inci- 

dent wave reaches the last 

point. The wave front is rotated due to the difference in propagation 
velocities. If the wave has penetrated into a denser medium, the 
radius of the largest wavelet should be less than the path traversed 
by the incident wave from the instant of excitation of the first point 
on the boundary to the instant of excitation of the last point. More- 
over, the ratio of these lengths should be equal to the ratio of the 


) 
| 
J Í 
: i 
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wave propagation velocities. On the other hand, as can be seen from 
Fig. 63, the ratio of the above distances is equal to the ratio of the 
sine of the incident angle and the sine of the refraction angle. Thus, 
we arrive at the well-known relation for wave refraction: 


sin @ c 


sinĝ 


GI 
t 


The direction of the propagation will be deflected toward the normal 
to the boundary if the wave moves into a medium of greater density 
and, on the other hand, if it 
moves into a medium of lesser 
density, it will be deflected 
away from the normal. The 


. c . 
ratio aan is known as the 


2 
refractive index. 


39. Reflection Coefficient 


The explanation of the ge- 
ometry of reflection and refrac- 
Fig. 63 tion may appear to be a 
somewhat uninteresting appli- 
cation of the theory. However, the wave theory enables us to do 
considerably more, namely, ascertain the relative proportions of the 
reflected and refracted waves as functions of the properties of the 
media whose interface is being considered. In order to simplify the 
calculations, we shall limit ourselves to the simple case of the 
normal incidence of a longitudinal wave on the interface of two 
media. The nature of the proof, however, is the same for all 
conceivable cases. 

In a discussion of this type, the following is axiomatic. At the 
boundary between two media, neither u; the velocity of the vibra- 
tions of particles, nor p, the incremental pressure, can change abrupt- 
ly. It is intuitively evident that it cannot be otherwise, but this 
can be shown rigorously on the basis of fundamental laws of physics. 

On one side of the boundary, there are waves having the instan- 
taneous velocities Uincia and U;esecr. On the other side of the bound- 
ary, there is also the wave haying the instantaneous velocity Urefract- 
The continuity of velocity yields the condition: Uineia + Urefiect = 
= Urefract; the continuity of pressure yields: Uincid P1C1 F Ureftect P1C1= 
= Uryefract P2C2- However, examining these two equations, we see 
that they are incompatible since 01C1 =Æ PoCo. How is this to be 
explained? The answer is that we have forgotten that the instanta- 
neous values of the velocities and pressures are vector quantities 
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and that even in the simple case when the displacement vectors are 
in a single plane the amplitudes may differ in sign. It can be seen 
that the equations become compatible only if the amplitudes of the 
velocity and pressure for the reflected waves are given opposite 
signs. The equations of continuity are then written in the form 


Uineid + Ureflect = Urejracts (Uincia = Ureftect) p101 = Ure fractP2Co 
or 
Uincid — Urejtect = Urejract} (Uincid +Urefiect) P161 = Urefractp2C2. 


We leave it to the reader to show that all other combinations of 
signs will fail to make the equations compatible. 

Since the amplitudes are positive quantities, the sum must be 
greater than the difference. Hence, the first pair of equations is 
valid when pyc; > psc and the second pair is valid for the reverse 
case. The first pair of equations arises when all the vibration veloc- 
ity amplitudes are in one direction and the phase of the reflected 
pressure wave differs by 180°, i.e., the amplitude of the reflected 
wave is oppositely directed with respect to the incident and refract- 
ed waves. The second pair corresponds to the reverse case. 


Pcl > P2ce2 O11 < pace 
Velocity wave Pressure waye Velocity wave Pressure wave 
incident > incident > incident > incident —> 
reflected —— reflected <— reflected <— reflected > 
refracted > refracted > refracted refracted > 


This interesting phenomenon of amplitude vector reversal im 
reflection may be described as a one-half wavelength loss or a 180° 
phase jump. Thus, the change of sign in the wave equation 
y = A cos o (¢ = = , where y is any physical quantity, may be 
obtained by introducing a 180° phase shift in the argument of the: 
cosine. On the other hand, a 180° shift in phase is equivalent to. 
displacing the wave distribution by one-half wavelength. 

Thus, at the interface of two media, the incident and reflected 
Waves act either to reinforce each other or to annul each other to the- 
Maximum extent possible. : f 

It should be recalled that in reflection a one-half wavelength loss: 
occurs for the vibration velocity wave when it enters a medium of 
greater resistance (sometimes inaccurately expressed as a medium 
of greater density). The displacement wave is inseparably linked 
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with the vibration velocity wave and also suffers a one-half wave- 
length loss. l 

In entering the second medium the wave does not execute a phase 
jump. i i , 

Solving the above equations simultaneously, we obtain the expres- 


g P 3 Urejlect 
sion for the reflection coefficient r, i.e., 


Uincident ` 
Thus 
pe Piha = poez 
Ti ni > 
Pici- Pate 
Fi os 8 s z Urefract 
where r isalways>0. Similarly, the refractive index g, i.e., ——— , 
a 5 Ujincident 
1s —_ 2 
ANGEN . . 
The wave resistance in air is much different from that in solid 
bodies. As indicated above, pc = 41 for air, while for steel (9 = 
= 7.9 gm/em® and c = 5,000 m/sec) pc = 40 x 410°. Thus, r = 
= 0.99999. This means that sound incident from air on steel is 
practically completely reflected and in effect does not penetrate 


the latter. It can be easily calculated that at the boundary between 
air and water r = 0.9997. 


40. The Doppler Effect 


Until now it has been assumed that the wave source and the receiv- 
er (i.e., the observer) were both stationary with respect to the 
medium in which the wave was propagating. Various effects, which 
were first noted by Doppler (1842), occur when the source or the 
observer, or of course both, move with respect to the medium. They 
consist basically in the fact that when the wave source moves the 
observer measures the vibration frequency v’ and when the observer 
moves he measures the vibration frequency v”. These frequencies 
differ from each other and from the frequency v that is measured 
when the observer and the source are stationary. 

In considering the Doppler effect, it is necessary in the first 
place to note that the wave leaving the source propagates entirely 
independently of the motion of the source and the observer. There- 
fore, in moving relative to the medium, the source or the observer 
may approach or recede from the moving wave. 

Why does such motion lead to the measurement of a frequency 
that differs from its “real” value? This is because the observer deter- 
mines the vibration frequency as the number of waves entering his 


apparatus per unit time. On the other hand, the formula v=— 


z A 
gives the number of waves emitted per unit time. If the observer 


moves toward the source with the velocity u, then in 1 sec the num- 
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ber of waves that he measures is not v, but a number larger than 
this. Moreover, the ratio of c -+ u, the relative velocity of the wave 
and the observer, to c is the factor by which the measured value is 


larger. Thus, 
v’ pu u 
= ; v=v(t+—). 


y. c 
If the source moves toward the receiver, the observer will again 
register a larger number of waves than when the source and the 
receiver are stationary. However, in this case, the reason for the 
increase is different. 

This is not evident at first glance. The motion of a source having 
a fixed frequency of vibration leads to a change in the distances 
between points of equal phase of the wave. If the first case is con- 
sidered to be crudely analogous to the motion of an observer toward 
a column of athletes running at equal velocities and maintaining 
the constant distance À between them, then the second case clearly 
requires a different interpretation. This case can be visualised as the 
slow displacement of the line of start. At equal intervals of time, 
the runners jump from an automobile moving along the track, which 
leads to a change in the distances between them. This distance 
becomes A’, not 2’. If the line of start (the source) is displaced in 
the direction of the observer and v runners start each second, then 
in 1 sec they are distributed over a distance given by c — u. Thus, 
the interval between runners (wavelengths) is v=, The fre- 
quency at which the runners moving with velocity c cross the finish 
line, i.e., the vibration frequency perceived by the observer, is 


” 


Ch ” 
y =F NEEN: 3 


Both of the above formulas are also valid when the source and the 
observer are moving apart. In this case, it is merely necessary to 
reverse the sign of u. 

Thus, it has been shown that when the source and the observer 
approach each other the measured frequency of vibrations radiated 
by the source increases. When the source and the observer move 
apart the frequency decreases. 

A well-known example of the Doppler effect is the change in sound 
of the whistle of a locomotive as it passes an observer. When the 
locomotive approaches the observer the frequency of the sound is 
higher than the real frequency. The pitch changes abruptly when the 
locomotive sweeps past him. In receding, the frequency of the sound 
perceived is lower than the real frequency. For a train moving at 


ʻa velocity of 70 km/hr the jump amounts to ~12 per cent of the 


real frequency. 
9-1409 


CHAPTER VII 
STANDING WAVES 


41. Superposition of Two Waves Travelling 
in Opposite Directions 


Let us assume that two plane waves haying exactly the same 
characteristics are moving in opposite directions. We are interested 
in the resultant vibrational motion of the medium in which the 
waves are propagating. 

As indicated above, a difference in the direction of propagation 
is taken into account by a difference in the coordinate signs in the 
wave equation. The resultant displacement should, therefore, be 
given by the expression 


y=Acos@ (2 —=) +A coso (¢ +=) =2A cos cos ot = 


Qt. 
= 2A cos a cos ot. 


This result is very interesting, for the sum of two travelling waves 
has not yielded wave motion. The formula obtained indicates the 
presence of vibrations of amplitude 2A cos 2t , Whose numerical 
value depends on the location in space. We call this peculiar vibra- 
tional state of the medium a standing wave, which arises whenever 
two identical travelling waves move in opposite directions. It should 
be emphasised that a standing wave is not a wave in the usual 
sense. A travelling wave transfers energy from one point to another, 
but this is in no way true of a standing wave. A travelling wave 
can move to the right or to the left, but a standing wave has no 
direction of propagation. The adopted designation merely character- 
ises the vibrational state of the medium. 

What are the characteristic features of this vibrational state? 
In the first place, we see that not all the particles of the medium 


vibrate. At the points in space satisfying the condition pea 


i? 
3A 5A nabrati i i 7 

DE oh the vibration amplitude is equal to zero. These points 
are known as the nodes of the standing waye. The distance between 
fwo adjacent nodes along the z-axis, the direction of propagation 
of the travelling waves, is equal to. one-half wavelength. Between 
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every two nodes is a point that vibrates with a maximum amplitude 
of 2A. Such points are called the antinodes of the standing wave. 

Fig. 64 shows the vibrational state corresponding to the standing 
wave at several successive instants of time. We see that the adopted 
designation is fully justified. At each instant a wave can be seen, 
but the wave does not move. A series of consecutive snapshots will 
show that the points of intersection of the wave with the abscissa, 
i.e., the nodes, remain fixed. The 
wave is stationary and the only 
change occurring between snapshots 
is a change in the magnitude of 
the displacements. At a certain in- 
stant, all points of the medium are 
motionless. After this instant of 
time, points that previously diverged 
upwards will now diverge down- 
wards, and vice versa. It is evident 
that this picture has nothing in com- 
mon with the travelling wave shown 
in Fig. 57, where two comparable 
“snapshots” are depicted. There the 
wave is seen to move. At each suc- 
cessive instant, the maxima and min- 
ima of the wave pass to new loca- 
tions. 

We stated that no energy transfer 
occurs in a standing wave. Then how 
can we describe, in terms of energy, 
the processes occurring in this pe- 
culiar vibrational motion? Clearly, 
the energy of a standing wave (for 
some region in which it exists) is 
a constant quantity. At the instant 
when all the particles are passing through the equilibrium, position, 
ail the energy of the vibrating particles is kinetic. On the other 
hand, at the position of maximum deviation of the particles from 
the equilibrium position, the energy of all the particles of the body is 
potential. 

A standing wave is a very important vibrational process. Stand- 
ing waves of different types arise in bodies of limited dimensions 
through which elastic waves are propagated. This is because elastic 
waves are reflected back from the boundary into the body. A com- 
plex vibratory state arises in the finite body which is the result of 
the superposition on the source wave of all the waves that were 
reflected from the walls. Several typical cases will now be considered. 


g* 


Fig. 64 


132 Standing Waves 


42, Free Vibrations of a Rod 


By means of a blow or otier means, it is possible in any solid 
rod vto excite a longitudinal wave that propagates along the length 
of the rod. From the opposite end of the rod, this wave is reflected 
and in this manner the entire rod is put into a vibratory state re- 
presented by a standing wave. The state will be one of free vibrations 
since it arises as the result of an impulse of short duration and con- 
tinues without the action of external forces. We can predict the 
behaviour of these free vibrations if we know the length of the rod 
and how it is fixed. These data are known as the boundary condi- 
tions. It will be found that a node of the standing wave is located 
at the point where the rod is fixed and an antinode of the standing 
wave is located at the free end. 

We shall now consider several modes of excitation of free, longi- 
tudinal vibrations in a rod of length L. 

A Rod Fixed at Both Ends. In this case, nodes of the displacement 
wave are formed at the ends of the rod. Since the distance between 
nodes is equal to one-half wavelength, the wavelengths that are 
possible, in terms of the length of the rod, are given by the condition 


7 2L ù 
L=nz,ie., Mn =F where n is any integral number. 


Using for the velocity of an elastic wave the expression c= Ma A 


f 
and recalling the relationship between frequency and y yelength, 
we obtain the expression for the natural frequencies of the free 
longitudinal vibrations of the rod: 


map Y= 
TS. 2D pri 


The qualitatively new content of this result should be noted. 
A solid body does not have one, but a multiplicity of natural (char- 
acteristic) frequencies of vibration. Hence, a rod can execute 
a variety of free vibrations. It is also possible for a rod to perform 
nonharmonie vibrations having any arbitrary spectrum * consisting 
of the frequencies v,. 


‘The frequency v; is the fundamental frequency of vibration of the 
rod. It corresponds to the vibratory motion for the condition Z =~ a 
This means that for the fundamental vibration an antinode of the 
standing wave is at the centre of the rod and there are no nodes be- 


* The word “spectrum” is used quite often in physics to denote a set of par- 
ticles having different velocities, masses, etc., or a set of waves having differ- 
ent wavelengths (frequencies), etc. 
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tween the ends. The vibrations of the second overtone (second har- 
monic) correspond to the condition L = à. Now, there is a node 
at the centre of the rod. If the third harmonic is excited, there will 
be two nodes between the ends of the rods, ete. 


Example. For a steel rod (p = 7.7 gm/cm? and E = 24,000 kg/mm?) whose 
length is 7 metres, the fundamental frequency is vy = 365 cps. 


A Rod Free at Both Ends. If a rod is suspended by a thin string 
and then vibrations excited in it, the resultant standing wave must 
satisfy the condition that antinodes are located at both ends of the 
rod. Just as in the previous case, the connection between the length 


of the rod and the wavelength is expressed by the relation: L = n& 4 


Hence, the formula for the natural frequencies will also be the same. 

The difference between this case and the previous one is in the 
distribution of the nodes and antinodes. For the fundamental vibra- 
tion, the centre of the rod is at rest (node). If the second harmonic 
is excited, there will be an antinode at the centre; one-quarter wave- 
lengths away—nodes; and at the ends—antinodes. 

A Rod Fixed at One End. In this case, there will be a node at the 
fixed end and an antinode at the other end. For the fundamental 
vibration, the rod has a form corresponding to one-quarter of a peri- 
od of a sinusoid. Since the distance between a node and an antinode 
is equal to + the relationship between the wavelengths and the 
length of the rod is given by the condition 


A 
L=n7 > where n=1, 3, 5,... 


The natural frequencies of the vibrations of such a rod are given 
by the formula 


In the first two cases, the frequencies are related to each other as 


the whole numbers. Here, they are related-to each other as the odd 


numbers. i 
A rod fixed at the centre will have a node at the fixed point and 


antinodes at the ends. The problem is essentially the same as above. 

The boundary conditions used in the consideration of the vibra- 
tory state of a rod are an extreme case of the boundary conditions 
for reflected waves, considered on p. 427. As was explained earlier, 
reflection from a boundary separating one medium from another 
medium of greater resistance is accompanied by a loss of one-half 
wavelength in the displacement wave. If the rod is fixed, the wave 
does not penetrate the second medium at all. In this case, the second 
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medium can be said to have an infinitely large resistance. The coef- 
ficient of reflection is equal to unity and the reflection is accompa- 
nied by a loss of one-half wavelength. It is not difficult to see that 
this corresponds to the presence of a node at the boundary between 
the two media. The reflection of the wave from the free end of the 
rod corresponds to reflection from a medium having zero resistance. 
A reflection coefficient equal to unity and the absence of a half 
wavelength loss leads us to conclude that an antinode must exist 
at such a boundary. 

Free longitudinal vibrations may also be excited in columns of 
liquid and columns of gas. 

Free lateral vibrations are easily excited in a string under ten- 
sion. The distribution of the nodes and antinodes will naturally be 
the same as for a rod fixed at both ends. The set of frequencies is 
expressed by a formula analogous to that derived for the rod, the 
only difference being that in the expression for the velocity of the 
lateral wave it is necessary to replace Æ by the tension, i.e., the 
force stretching the string divided by the cross-section. 


43. Free Vibrations of Two-Dimensional 
and Three-Dimensional Systems 


In rods, strings and air columns, the constant-phase surfaces of 
equal phase consist of parallel planes. The vibratory state may be 
conceived as the result of superimposing plane waves extending 
along a single line. However, more complex cases are possible. Thus, 
we can have vibratory motion encompassing a two-dimensional 
region, e.g., plates and membranes, or encompassing a body whose 
three dimensions are of equal order of magnitude. 

The vibration of elastic and rigid diaphragms is a two-dimensional 
problem. A plate fixed at its edges will have a different mode of 
vibration than a plate fixed at a single point or not fixed at all. 
Apart from the vibration of rigid plates, vibration of stretched 
nonrigid films, e.g., rubber and soap films, is also encountered. 

In principle, the general behaviour of the free vibrations in this 
case does nod differ from that already considered. Since this is a 
two-dimensional problem, the nodes and antinodes will now con- 
sist, in general, of curved lines. For example, the fundamental 
vibration of a circular plate fixed along its circumference has a sin- 
gle antinode (a point in this case) at the centre of the circle, i.e., 
the central point vibrates with maximum amplitude. As we move 
toward the edge, where a nodal circumference is located, the ampli- 
tude gradually decreases while maintaining circular symmetry. This 
is the simplest case, namely, vibration of the fundamental (lowest) 
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frequency. A membrane may be excited at a higher harmonic; in 
such a case the surface is broken up by nodal lines. It turns out that 
nodal lines in a circular plate may have a circular form or consist 
of diameters passing through the centre. 

The demonstration of nodal lines by the Chladni method (named 
after the scientist who proposed it) is an effective and simple expe- 
riment. A plate sprinkled with sand is put into a vibratory state by 


SS SS 


Fig. 65 


means of a blow or a fiddlestick. The sand rolls away from the anti- 
nodes and gathers along the nodal lines. Fig. 65 shows several Chlad- 
ni figures. $ 

The vibratory state of a solid three-dimensional body is, of course, 
the most complex. We shall avoid consideration of this phenomenon 
in a body of complex form and restrict our study of such free vibra- 
tions to a right-angled parallelepiped. If the standing waves 1n 
such a body were only due to the superposition of waves travelling 
Parallel to an edge of the parallelepiped, the natural frequencies 
of the vibrations would be limited to the values 

cH) u URRY 5 aS 
in Dla ee 

yeas Ny, Ng, ng are arbitrary wie sae: and li, l2, la are the 
ength es of the parallelepiped. ; 

Tee E propagating in the body may Tornapa e 
al, any angle with the boundaries. In this case, the stan ing way es 
are formed after a number of reflections, when the beam returns 
to the exact point from which it left. Such an gaatt nae 
may be conceived as the superposition of waves of aie eae 
cies travelling along different edges of the parallelepiped. ‘Their 


Natural frequencies of vibration are: 
nge 
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and the wave numbers are 


pee Bi 2 nz ; n3 
kı P? kə 2? ks aia 
In superimposing the waves, Æ is determined by addition, but it 
should be noted that the addition is vectorial. Thus, 


Spa 

5 c ni o n3 
o 
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It is evident that the frequencies of vibration for the simple cases 
of wave propagation parallel to the edges of the body are also obtain- 
able from this formula; in such a case only one of the three whole 
numbers in the formula is different from zero. 

The vibration spectrum of a three-dimensional body is depicted 
in Fig. 66 in three-dimensional space, which may be called frequency 


space or inverse space. Here, the quantities rA r A z oa are plolted, 


respectively, along the three axes. Each node in the lattice 
(inverse lattice) thus formed represents one of the natural fre- 
quencies of vibration of the body for the numbers ni, na, ny. The 


= VETTE 


Fig. 66 


radius vector drawn to a node of the lattice in the inverse space re- 
presents a possible vibration frequency. A sphere of radius v includes 
all points corresponding to frequencies less than v. The volume of 


7 4 
such a sphere is equal to zoe and the volume of each cell of the 


c 


inverse space is equal to (=) */v, where v is the volume of the body. 


Therefore, the number of free vibrations of a body with frequencies 
less than v (the number of nodes in an octant of the sphere) is ex- 
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pressed by the formula 
4 v3 
yuu xz: 
This interesting relationship shows that the number of natural 
frequencies increases sharply as the band of frequencies being con- 
sidered increases. For high frequencies, the discrete character of 
the spectrum becomes blurred, for the frequencies are very close to 


each other. 
44. Forced Vibrations of Rods and Plates 


If the vibration of a rod, plate or other body does not take place 
in vacuum but in some medium*, namely, liquid or gas, then 
a fraction of the intensity, depending on the ratio of the wave resist- 
ance of the contiguous media, is transferred from the vibrating 
body to the medium. This idea can briefly be expressed as follows:- 
a vibrating body radiates energy. Due to radiation, the free vibra- 
tions of a rod, string, etc., are rapidly attenuated. If it is required 
that the body be a constant source of radiation, the vibration must 
be excited by an external source. Just as in the case of particle 
vibrations, the energy may be provided either by means of self- 
sustained vibrations or by producing forced vibrations. 

Depending on the method of providing the external energy and 
on its point of application, one can excite, generally speaking, one 
or mote of the natural frequencies of a body capable of vibrating. 
One can, for example, produce forced vibrations in a string under 
tension in the following manner. An electromagnet fed by sinusoidal 
current from an audio generator is fixed about a steel wire. The vibra- 
tions of the wire under the action of the periodically varying exter- 
nal lateral force become perceptible only at resonance. By varying 
the tension of the wire and the external frequency, one can show 
that the wire will vibrate at the fundamental frequency as well 
as at various overtones. í y y 

The production of forced vibrations (standing waves) in piezo- 
electric plates and ferromagnetic rods is of great practical impor- 
tance. Such vibrating bodies are generators of ultrasonic waves- 

Ferromagnetic bodies may elongate or shorten under the action 
of a magnetic field. The theory of this phenomenon 1s complex and 
will be treated only briefly in this book. For the present, it is suf- 
ficient to illustrate how the length of a ferromagnetic rod depends 
on the intensity of the field. This is done in Fig. 67, which shows 
_ 


ec) » reconciled to the fact that yibration of a body is used 
in two Se oC of a body as a whole and vibration of the particles 


of a body with respect to one another. 
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that nickel and annealed cobalt shorten in fields of any intensity, 
cast cobalt shortens in weak fields but elongates in strong fields 
and, finally, iron elongates in weak fields and shortens in strong 
ones. In any case, a ferromagnetic rod can execute forced vibrations 
when placed in an alternating magnetic field. For this purpose, 
the rod is usually placed in the core of a transformer fed by alter- 
nating current. In order for the 
standing wave in the rod to be of 


Sl sufficient intensity, it is necessary 
Q Co (cast) a A 
2 to operate under conditions of res- 
= SSS ea onance, i.e., the frequency of the 
S -m alternating field should coincide 
RI with the rod’s natural frequency 
820 of vibration. 
Ss . . 
W Since the rod is fixed at the 
Ñ centre, the natural frequency of 
È -40 vibrations is 

1000 2000 x 
intensity of magnetic field in oersteds v= ae 

Fig. 67 and the rod can vibrate only at 


j rae the frequencies of the odd harmon- 
ics. Substituting the numerical values of the physical constants, 
the fundamental frequency for nickel turns out to be equal to 


2 
yal ke/s (where Z is in centimetres). 


Thus, a rod having a length of 40 cm will vibrate at a fundamental 
frequency of 6 ke/s. 

A piezoelectric crystal is most commonly used as a source of ultra- 
sonic vibrations. ` 


45. Piezoelectric Vibrations 


Any crystal that does not have a centre of symmetry in a number 
of its elements of symmetry (see Sec. 262) may exhibit the piezo- 
electric effect. This phenomenon manifests itself in a change of the 
dimensions of a crystal under the action of an electric field and, 
conversely, in the creation of an electric field in a crystal under the 
action of forces applied to the crystal. When utilising the piezo- 
electric effect as a source of vibrations, we are dealing, of course, 
with the former aspect of the phenomenon, known also as electro- 
striction or the inverse piezoelectric effect. Piezoelectric materials 
include quartz crystals, Rochelle salt, barium titanate, and 
dihydrophosphate of ammonia. Generally speaking, there are hun- 
dreds of known materials that could, in principle, be used for this 
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purpose. However, additional requirements, e.g., durability and 
stability with respect to moisture as well as the natural desire to 
select crystals that will yield the strongest effect, sharply limit the 
practical choice of material. 

The change in crystal dimensions under the action of an electric 
field differs for different directions (with respect to the crystal’s 
axes of symmetry). Therefore, different deformations will be obtained 
when, from a crystal, we cut rods or plates having different 
Orientations with respect to the crystal’s axes and place them between 
condenser plates. Usually, the quartz plate or other piezo- 
electric material is cut in such a manner that longitudinal displace- 
ments occur in the material when it is placed in an electric field. 
Thus, under the action of an alternating electric field, the forced 
vibrations produce standing longitudinal waves. 

If Z is the thickness of the plate in the direction of wave motion, 
then, as usual, the natural frequency of vibration is given by the 
formula v = For a quartz crystal having this simple orientation, 
the velocity of the elastic wave is equal to 5,400 m/sec. Hence, the 
fundamental natural frequency of vibration of a quartz plate is 
determined from the formula 


» 2100 ke/s (J is in centimetres). 


It should be noted that the measured value is somewhat different, 
namely, 2,880/1 ke/s. y à 

a a amplitudes depend on the magnitude of the applied 
field, whereby a linear dependence exists between the displacement 
magnitude and the electric field intensity. It is not uncommon to 
use large field intensities. Since quartz is an excellent insulator, 
electric fields of the order of 30,000 volts/em find application for 
thicknesses up to a centimetre. . : ; 

To attain a powerful ultrasonic signal, use is made of the reso- 
Nance effect. This is essential because resonance displacements are 
thousands of times greater than displacements under the action of 
Static fields, and furthermore, the vibration energy is proportional 
to the square of the displacement. ; s 

By gradually increasing the frequency of the gener ator, one can 
Successively excite all the overtones of the crystal. The frequency 
range of commercial ultrasonic generators extends from hundreds 


to thousands of kilocycles. 


CHAPTER VIII 


ACOUSTICS 


46. The Objective and Subjective Nature of Sound 


Man can perceive the loudness, pitch and timbre of sound by 
means of his hearing organs. The electronic oscilloscope enables us 
to investigate the objective and subjective nature of sound in detail. 

Since sound is the result of a vibratory process taking place in 
air, it may be completely described by a curve showing amplitude 
change (immaterial whether displacement, vibrational velocity or 
pressure) with respect to time. Such a curve enables us to establish 


Fig. 68 


whether the process is periodic and, if so, to determine the funda- 
mental tone of the vibration. By studying the periodicity, the over- 
tones present and their amplitudes may be determined. In other 
words, the curve showing the dependence of the vibration on time 
always enables us to find the spectrum of the vibration, i.e., to 
establish which frequencies are present and the amplitudes of these 
frequencies in the spectrum, The curve is obtained by means of 
a microphone connected to an oscilloscope. In more elaborate ar- 
rangements, the curve of the vibration is automatically converted 
into its spectrum. 

A simplified diagram of such an analyser is shown in.Fig, 68. 
The input sound is converted by a microphone into electrical 
current, is amplified and applied to an apparatus consisting of 
a large number of filters, each of which Passes a specific band of 
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frequencies, e.g., + of an octave (36-48, 48-60, 60-72 cps, etc.). 


The filters resolve the signal into its spectrum; the narrower the 
frequency band of each filter the greater its resolving power. Each 
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Portion of the spectrum passing through a filter is fed via a commuta- 
tor to an amplifier and detector (rectifier). The output of the detec- 
lor is applied to the oscilloscope plates that deflect the electron beam 
in the vertical direction. If a voltage is not applied to the second 
Pair of oscilloscope plates, then, upon connecting each of the filters, 
the vertical deflection of the electron beam will be proportional 
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to the amplitude of the corresponding frequency component of the 
spectrum. However, the arrangement is greatly improved by con- 
necting a voltage to the second pair of plates that provides hori- 
zontal sweep of the electron beam. This is done by properly syn- 
chronising the rotation of commutator 2, which provides the 
sweep voltage, with the automatic rotation of commutator 4. Thus, 
the amplitude of the component passed by each filter is displayed 
for a different, but unique, horizontal displacement of the electron 
beam. As a result, the complete spectrum is displayed on the oscil- 
loscope screen. 

Periodic vibrations have discrete spectra, while nonperiodic 
vibrations have continuous spectra. Musical sounds are illustrative 
of the former, while various kinds of noise illustrate the latter. 

One and the same musical tone played on different instruments 
will have the same fundamental frequency, but will have different 
spectra. The quality of the sound is determined by the distribution 
of the overtone intensities (see Fig. 69). In the musical sense, the 
more complex the spectrum the richer the quality of the sound. 
It is interesting that the phase of the overtones (see the formula 
on p. 105) does not affect the subjective perception of sound. The ear 
can only distinguish between the intensities of the overtones. 

Noise analysis is of great practical importance. If the noise fre- 
quencies of largest intensity are known, the cause of the noise is 
- more easily determined and, consequently, more easily eliminated. 


47. Intensity and Loudness of Sound 


In Fig. 70, the heavy lines mark the limits of the region of audito- 
ry perception for the average person. Two uniquely related quanti- 
ties are plotted along the ordinate, namely, the amplitude of sound 
pressure and the intensity of sound. The sound pressure p and the 
sound intensity J are related, in the simplest case, by the formula 


We know that the intensity of the wave is 
I=we, 


where w is the energy density, i.e., w Ze Pease. 16 = (see p. 117). 


Hence, substituting, we obtain the above formula. The intensity of 
sound may be measured in walts/cm?. 

Very intense sounds produced by a pressure of about 2,000 bars 
cause a sensation of pain. Very weak sounds can still be perceived 
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by the average person when they have a pressure of 2 x 10-4 bar 
(1 bar = 1 dyne/cm?). Since for air pc = 41, we obtain for the limits 
of sound intensity the values 0.5 x 105 ergs/sec cm? (=0.5 x 10-2 
watts/em?) and 0.5 x 10-18 watts/em?. ` 
This large range of intensities makes it convenient to introduce 
a logarithmic scale. If the intensity of one sound is J, and the inten- 
sity of another 7%, we say that Js is K decibels louder than J, when 


K=10log—2 


ST fe 


The quantity K is called the loudness level. Thus, if the sound 
intensities differ by a factor of a million, they differ in loudness 
by 60 decibels. 

When expressing the sound intensity in decibels, it is necessary 
to indicate the zero level. The yalue usually selected for this level 
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is close to the threshold of audibility (1071 watts/cm?). A whisper, 
then, has a loudness of about 15 db and the noise of an airplane 
about 120 db. Bat 2 a A 

Returning to the diagram of auditory perception, we see that the 
region of Soth is nite restricted in frequency (100 to 10,000 eps) 
as well as in intensity (40 to 80 db). Sounds of different frequency 
have different audibility. The human ear perceives frequencies of 
Several thousand cycles per second best of all. Below 20 eps lies 
the infrasonic region and above 10,000-20,000 cps the ultrasonic. 

The table gives approximate values for the sound pressure p, the 


intensity Z and the loudness K. 
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Pp I K 
(bars) (watts/cm2) (db) 
Threshold of audibility ...... 2.9% 10-4 10-16 0 
Drip of thawing snow... ... . 2.9 10-3 10-14 20 
Low conversation at a ‘distance of 
S CULCS MEO M Same ala 0°), vm 2.9% 10-2 10712 40 
Symphonic orchestra (fortissimo) 2.9 10-8 80 
Airplane engine at a distance of 
SEES N Rant. cle hag AMD 290 10-4 120 


48. Architecture and Acoustics 


In some auditoriums speech is unintelligible even though suffi- 
ciently loud, while in others the speaker must raise his voice in 
order to be heard. Let us investigate the physical parameters of an 
auditorium that determine its acoustic properties. 

Experiments show that the most important factor. of this nature 
is the so-called reverberation time, the time in which a sound decreases 
to one-millionth of its original intensity. With respect to acous- 
tics, an auditorium is best when its reverberation time t is 0.5- 
1.5 sec. If t is less than 3 sec, the auditorium is considered to be 
good. If the reverberation time exceeds 5 sec, the acoustics are very 
bad, being characterised by “resounding”. 

A sound uttered at some part of a large hall is reflected- from the 
walls, floor and ceiling, the furniture and drapes, and from the 
clothes of those present. If for each reflection the sound loses a large 
part of its energy, the sound will be attenuated very rapidly. The 
reverberation time in this case is very small and the sound will be 
“dull”. Resounding occurs when the sound is repeatedly reflected 
with little attenuation. The listener will perceive the direct wave, 
the wave after one reflection, two reflections, etc. If the interval 
of time between the arrival of these sound waves does not exceed 
1/15} of a second, the ear will not perceive two or three distinct 
sounds as in the case of echoes, but rather a prolonged, and hence 
unclear, sound. 

~ It is evident that the time attenuation of sound is determined 
by its absorption in the surrounding bodies. Since the sound is 
repeatedly reflected, after a short time of constant sounding from 
some source the auditorium will more or less uniformly fill up with 
sonic, i.e., vibratory, energy. Within a short period of time, equi- 
librium is established between the energy delivered by the source 
and the energy absorbed by the medium. It should be noted, inci- 
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dentally, that in the absence of absorption the sonic energy in a 
closed room would increase without limit for continuous sounding 
of a source. K 

If the sound source is interrupted, then the phenomenon reduces 
to the absorption of the sonic energy by the surface of the bodies 
located in the room. Each of the materials involved in this process 
has its own characteristic coefficient of absorption a. If there is an 
open window in the room, the absorption coefficient may be as- 
sumed to be equal to 1, since the sound completely leaves the room, 
and this is equivalent to being absorbed. For a smooth, solid wall 
the coefficient a is almost zero (for concrete, it is 0.015). Now, the 
sound absorption for the entire room may be described by the ex- 
pression A = aS, + G@oSo + 4353+ >- -a where the sum takes 
into account all the surfaces in the room. Theory shows that the 
reverberation time depends on the quantity A and the volume of 


‘the room V, i.e., T= 0.46. In this formula, the volume is ex- 


pressed in cubic metres and the quantity A in square metres. 

By means of these formulas, it is not difficult to calculate the 
reverberation time. The absorption coefficient for concrete is-given 
above; for glass, wood and plaster, it is not much larger (up to 3 per 
cent). A sharp increase in absorption occurs when soft materials 
are brought into the room. Suffice it to note that the clothing of 
One person absorbs as much sound as 20 square metres of wall sur- 
face. For soft materials, the coefficient of absorption varies between 
0.5 and 0.9. A large role in the solution of acoustical problems in 
the construction industry is played by porous materials (e.g., spun 
glass and porous concrete), whose coefficients of absorption approach 
the values of æ for soft materials. 


49. The Atmosphere and Acoustics 


ses from one medium into another, it changes 
ation in accordance with the law of refraction. 
he direction of propagation changes is deter- 
i.e., the ratio of the velocities 


_ When a wave pas 
its direction of propag 
The angle by which t i 
mined by the index of refraction, 
of propagation. 

It was indicated in Sec. 


is sensitive to changes in temper 1 
1°C increases the sound velocity by about 0.5 m/sec. The temperature 


i : rule, different 
of different layers of the Earth's atmosphere has, as a rule, a 
Values. Thus, in different layers of the atmosphere, sound will have 
different velocities. How is the propagation of sound affected by 
the fact that the sound travels in a medium in which the refractive 
index is continuously changing? 


10—1409 


32 that the velocity of sound propagation 
ature. A temperature increase of 
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Let us answer this question by referring to Fig. 71. Assume that 
the sound passes through a series of layers and that in each layer 
the refractive index is constant but changes abruptly from layer to 
layer. The path of the sound wave is represented by the broken line. 
If the thicknesses of the layers are small and the differences in the 
refractive indexes begin to decrease, the broken line will approximate 
a curved line. Thus, in a medium of variable index of refraction, 
sound waves propagate, generally speaking, along curved lines. 
Moreover, the path is always such 
that the wave travels from point 
to point in the shortest time. 
This proposition is known as 
Fermat's principle. A straight 
line, in this case, is in a certain 
sense not the shortest. 

We shall demonstrate the va- 
lidity of this principle for the 
case of two adjacent segments 

Fig. 71 of the broken line just consid- 

ered. Let us assume, for simplic- 

ity, that the thickness d is the same for both layers and that the 
propagation velocities v; and v, are different. The time required 
for a wave to traverse the path indicated in thie figure is equal to 


r= VPP E41) Ge i 
1 2 t 


Here, the time is expressed in terms of the independent variable x. 
For different values of z, the refraction will differ and so will the 
time of travel from the initial point to the final point. The least 


à ? one dt f, 3 3 
time will be taken when the condition a 0 is satisfied, i.e., when 
ya x a—r 


T . e s z mk . 
But Verna is the sine of the incident angle and ——“—*___ ig the 


a— x)? 4- d2 
sine of the refraction angle. This proves that the E of the 
wave occurs in such a manner that its time of travel is a minimum. 
It should be emphasised that this result is valid not only for elastic 
waves but for all undulatory processes. 
„ Thus, a wave travelling in a nonhomogeneous medium changes 
its direction in such a manner that its path is lengthenéd in a medi- 
um in which the propagation velocity is larger and shortened in 
a medium in which the propagation velocity is smaller. In other 


3 
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words, a wave in a layer where the propagation velocity is large 
will tend to travel parallel to the layer, while in a layer where the 
propagation velocity is small it will tend to travel perpendicular 
to the layer. ‘ 

This is clearly illustrated in Fig. 72. Here, the path of a sound 
wave is schematically shown for the case when the temperature 


fa) Day-time 
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of the air decreases with height (usual day-time condition) and for 
the case when the temperature increases with height (night-time 
condition). 

In the former case, the velocity of sound propagation is large in 
the layers close to the Earth. If we trace the propagation of the 
sound wave emanating from a point above the Earth’s surface at 
a small angle with the vertical, the following situation is seen to 
prevail. Each of the successive layers deflects the wave further and 


further away from the vertical. When the angle of incidence 


sini z 
= 1, refraction ceases 


becomes equal to the angle io, for which 


and total reflection occurs. Formally, the reason for total reflec- 
tion is clear, namely, sin i cannot become larger than unity. 
The physical basis of this interesting phenomenon will be consi- 
dered below (Sec. 128) in connection with electromagnetic waves. 
In any case, instead of being propagated along the Earth's 
surface, the wave is turned in an upward direction. The diagram 
makes clear how “zones of silence” are formed. At night, the path 
of a sound wave is turned convex upwards. As a result, audibility 


10* 
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at night is much better than during the day. When sound propagates 
over a reflecting surface (a still body of water), it can be heard for 


a9 


Fig. 73 


several kilometres even when its intensity is relatively low. The 
path of such a wave is represented by a series of consecutive convex 
ares (Fig. 73). 


50. Ultrasonics 


The vibratory energy in a unit volume of a sonic field is proportion- 
al to the square of the frequency. Thus, the density of the vibrato- 


ry energy is w=% , but the amplitude of the velocity is uo = Aw, 


so that w is proportional to œ°. A powerful ultrasonic source is capa- 
ble of producing vibrations with a pressure amplitude of dozens 
of atmospheres. This means that in small volumes of matter, we 
go through the following cycle several thousand times per second: 
up to dozens of atmospheres of compression, down to zero, then up 
to dozens of atmospheres of expansion, etc. 

It is evident that a powerful mechanical action of this nature 
may have a number of specific effects. One such effect is cavitation. 
At the instants of vibration corresponding to maximum expansion 
in a liquid located in an ultrasonic field, microscopic explosions 
occur and dissociated gases and steam rush into this region. At the 
instants of vibration corresponding to compression, tremendous pres- 
sures of the order of thousands of atmospheres are produced in the 
regions of these explosions. 

This powerful force may be used to overcome the forces acting 
between molecules. Emulsions such as fat in water and benzene in 
water become dispersed under ultrasonic action. Sooner or later 
cavitational explosions occur in the suspended particles. This disin- 
tegrating action has found wide application in industry. 

However, ultrasonic action may be of considerable importance 
even when cavitation does not occur. Thus, if an ultrasonic wave is 
passed through aeorsol (a suspension of solid particles in a gas, e. g., 
smoke), the particles are precipitated out. The vibrations cause the 
solid particles to gather at the sound pressure nodes, where the 
particles merge and become sufficiently heavy to fall to the ground. 

‘Finding blowholes, internal cracks and other defects in metals by 
means of ultrasonic irradiation is another important field of appli- 
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cation. The method is based on the reflection of such a wave from 
the boundary between the medium and air, i.e., between the metal 
and the inclusion. Only if the dimensions of the defect are greater 
than a wavelength will the method work. In order to detect a defect 
having a dimension of 1 mm, the wavelength should be less than 
0.1 mm, i.e., a frequency of the order of 10° cps. The frequencies 
used are usually much lower (10° cps), the method being employed 
to detect large flaws. 

As is well known, ultrasonics also finds application in echo sound- 
ing and underwater location. 


CHAPTER IX 
TEMPERATURE AND HEAT 


51. Heat Equilibrium 


When all the properties of a body remain unchanged, we say 
that the state of the body has not changed. On the other hand, when 
some property of the body changes, its state changes. The state of 
a body may be changed by doing work on it. However, the same 
results may be achieved without using mechanical means. Water 
is heated by intensely stirring it or by placing it on a gas burner. 
Heat exchange is said to take place when the external medium or 
surrounding bodies act on a body or system of bodies under consid- 
eration so as to change the state of this body or system of bodies 
by nonmechanical means, 

If there is no heat exchange between the bodies, the bodies are 
in thermal equilibrium and have the same temperature. The presence 
of thermal equilibrium can be directly verified by bringing the 
bodies into contact with each other: whereupon the states of the 
bodies after contact should not differ from the states before contact. 
However, heat exchange is also possible when bodies are far apart. 
Thermal equilibrium may be detected, in this case, by means of 
a third body acting as a thermometer. If the thermometer is in 
equilibrium with both bodies, the temperature of these bodies is 
the same. This means that they would also be in a state of thermal 
equilibrium when in direct contact. By means of a “third body”, 
a thermometer, it can always be ascertained whether bodies have 
equal or different temperatures. 

By means of a thermometer, we can establish not only whether 
thermal equilibrium prevails or not, but also the extent of a partic- 
ular deviation from equilibrium. To obtain a suitable thermometer, 
it is first necessary to agree on the type of thermometer (mercury, 
alcohol, water or gas) and the property (indication) by which we 
shall judge whether thermal equilibrium has been achieved between 
object and thermometer. As always in physics, it is important to 
agree on what instruments, in this case thermometers, will be con- 
sidered primary. A thermometer can, then, always be calibrated 
by means of a standard. Gaseous hydrogen is the material used in 
a standard thermometer and the gas pressure p is the property by 
which the temperature is determined. The temperature of a body 
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is taken as proportional to the hydrogen pressure in a gas ther- 
mometer at the constant.volume occupied by the hydrogen. 

The temperature scale is selected as follows. We call the temper- 
ature of the melting point of ice 0° and that of the boiling point 
of water 100° (at a pressure of 760 mm of mercury). Measuring the 
hydrogen pressures po and poo at these two points, and drawing 
a straight line through the plotted points, we obtain the Celsius, 
or Centigrade, scale. The equa- 
tion of this line, shown in 
Fig. 74, has the form 


t = -=P x 100. 
P100 — Po 

The straight line intersects 
the -axis at a temperature 
of —273.4°C. This is abso- -274/% . 0b t 
lute zero. By definition, lower 
temperatures are not pos- 
sible. In physics, we often 
use a temperature calculated on the basis of absolute zero, namely, 
T = t + 273.1°. This is called the absolute temperature or the tem- 
perature in degrees Kelvin (K). 

When we calibrate working thermometers with respect to a hydro- 
gen standard, only a limited interval of temperatures may be used. 
At high temperatures, diffusion of the hydrogen through the walls 
of the vessel may begin to occur. At low temperatures, the hydrogen 
may liquefy. Nevertheless, the adopted method of determining 
temperature has complete general validity as will be shown below 


(p. 160). 
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Fig. 74 


The basic characteristics of bodies in the presence of mechanical 
and heat interaction is very well depicted by the so-called kinetic 
molecular model. A body consisting of molecules is considered as a 
system of moving and interacting particles subject to the laws of . 
mechanics. Such a system of molecules has an energy consisting of 
the potential energy of the interacting particles and their kinetic 
eae of motion. This energy is called the internal energy of the 

ody. 

Apetite internal energy corresponds to a specific state of the 
body. Changes in the mutual disposition or character of particle 
motion are related to changes in internal energy. Irrespective of the 
means employed to increase the internal energy of a body, the sur- 
rounding bodies must transfer energy to the molecules of the body 
under consideration. If the body is subjected to mechanical action, 


ià 
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the energy transfer occurs in a regular manner. In the case of heat 
exchange, the energy is transferred through chance impulses trans- 
mitted now to one, now to another molecule. 

The quantity of energy transferred to a body by mechanical means 
is measured by the amount of work done on the body. The quantity 
of energy transferred through heat exchange is measured by the 
quantity of heat. 

Since the exact calculation of the internal energy of a body is very 
difficult and in most cases impossible, and since the very conception 
of internal energy as a purely mechanical quantity is only a rough 
one, a clear determination of this quantity is necessary. This can 
be done by considering processes occurring without heat exchange 
with the surroundings, i.e., so-called adiabatic processes. Adiabatic 
conditions can be provided by using a thermally insulated container 
for the experiment and taking the measurements during short inter- 
vals of time (so that the heat does not have time to “escape” from 
the volume under study). Numerous experiments leading to the 
establishment of the law of conservation of energy show that, irre- 
spective of the means employed to change the state of a body in such 
a process, the amount of work required is exactly the same in each 
case. The magnitude of this work, A, is equal by definition to U, 
the increment of the internal onergy of the body: 


A=U,—~U, 


Naturally, the absolute value of the internal ener: 
mined from the experiment. 

If the mechanical model of a body were a completely faithful 
representation, the above expression would be a simple consequence 
of the law of conservation of mechanical energy. However, the kinet- 
ic molecular model is only a model and, therefore, the fact that 
there is a specific energy corresponding to each state of a body, so 
that the difference in energy between two states is equal to the 
adiabatic work of transition, represents an extremely important 
law of nature leading to the law of conservation of energy, 

Heat exchange and mechanical action can lead in a number of 
cases to the same change of State, i.e., to the same change in the 


internal energy of the body. This enables us to equate heat and work 
by measuring the quantity of heat in the same units as work and 
energy. 


gy cannot be deter- 


, To obtain an idea of the magnitudes of various internal energies, let us 
cite some figures, 


When the temperature of 1 gm of water is raised by 


1°, the energy in- 
creases by » the energy in 


1 calorie ==0.427 kg-m= 4.18 x 107 ergs = 2.61 1019 ey, 
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In this, case, the average increase in energy of a molecule is 
3 x 10-23 calorie = 1.28 X 10-23 kg-m =1.25 x 10-15 erg = 7.83 X 10-4 ev. 
The internal energy given up by matter in the combustion of 1 gm of coal 


amounts to 
7,000 calories = 2,990 kg-m = 2.93 x 1011 ergs = 18.3 x 1022 ey. 


Calculating this on the basis of one atom of carbon, this figure reduces to 
1.4 10-19 calorie =5.98 x 10-20 kg-m =5.86 x 10-12 erg = 3.66 ev. 
The energy released in the nuclear fission of 4 gm of uranium-235 is 
2.03 x 1010 calories = 8.65 x 10° kg-m= 8.49 x 1017 ergs = 5.29 x 1029 ey. 


Calculating this on the basis of one atomic nucleus, the internal energy given 
up amounts to 
7.9 x 10-12 calorie =3.38 X 10-12 kg-m=3.3 x 10-4 erg = 
= 206 x 105 ey = 200 Mev, 


which is more than 50 million times greater than the energy of chemical re- 
actions. 


53. The First Law of Thermodynamics 


In the most general case, when energy is exchanged with the 
medium or with surrounding bodies, the system under consideration 
may receive or give up a quantity of heat Q and perform or have 
performed on it a given quantity of work A. Heat and work are the 
two forms in which the energy of a body may be transmitted to the 
medium or, conversely, the energy of the medium may be transmitted 
to the body. ‘The law of conservation of energy excludes the possi- 
bility of any loss in the energy exchange. The difference in the ener- 
gies of the system for the two states must equal the sum of the heat 
and work obtained by the system from the surrounding bodies. 

This proposition could not be subjected to experimental verifica- 
tion if we did not add that the incremental energy due to the tran- 
sition from one state to another is always the same irrespective of 
the character or method of transition from the initial to the final 
state. It is precisely this provision that embodies the law of conser- 
vation of energy. Now, it can clearly be subjected to all-sided exper- 
imental verification: by measuring the heat and work imparted 
to the system for various transitions from a particular initial state 
to a particular final state. The incremental energy in all cases will 
be identical. £ ` 

The law of conservation of energy expressed in the above concrete 
form is called the first law of thermodynamics. This very important 
law of nature was established as the result of the work of a number 
of scientists in the middle of the last century. The roles played by 
Robert Mayer, Joule and particularly Helmholtz rank especially 
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high on the list. As physics developed, the relationship of the first 
law of thermodynamics to the more restricted law of conservation 
of mechanical energy and the more general law of conservation and 
transformation of energy became ever clearer. During the early 
development of physics, there was a tendency to equate the law of 
conservation of mechanical energy with the first law of thermodynam- 
ics. The growth of science has shown that such a simplified view 
is erroneous, that many forms of energy do not reduce to mechanical 
energy. It was noted by Engels that the law of conservation: and 
transformation of energy, which indicates that any form of motion 
may be transformed into any other, subject to the preservation of 
certain quantitative relationships, is the most general law of exist- 


ence of matter. The first law of thermodynamics is a concrete expres- 
sion of this law. 


In order to write the fir: 
a formula, we must first 
work. Let us assume that | 
system and work is posit 
action of external forces, 
be written in the form 


st law of thermodynamics in the form of 
agree on the choice of sign for heat and 
heat is positive when it is imparted to the 
ive when a body performs it against the 
The first law of thermodynamics may then 


AQ= dU + AA, 


i.e., the heat applied to a body goes to change the internal energy 
and perform the work of the body. Naturally, each of the quantities 
entering into the equation may be positive or negative depending 
on the particular transformation being considered. 

It is not accidental that in writing the above equation the differ- 
ential sign was used only for the energy.. Work and heat are not 
total differentials. When a body goes from one state to another, 
the work and heat received or given up by the body depend on the 
“path” traversed, and only the energy increment, as in the case of the 
total differential of a function, does not depend on the manner of 
transition: 


2 


| w=0,—0,. 
1 


1 knowing anything about the nature of a par- 
ticular process except the initial and final States of the system under 
consideration, a number of important conclusions may be drawn. 
For example, assume that molecules A and B are united by a chem- 
ical reaction to form the molecule AB. Assume, further, that Ua: 
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Up and Uap, the internal energies of the molecules, are known. 
If Uap is greater than U4 + Ups, we are able to predict that the 
reaction will proceed with the absorption of heat and the quantity 
of heat will be equal to Q = Uas — (Ua + U»). Or, if we know 
U, and Up, and measure the heat of the reaction by means of 
a calorimeter, we can determine U3. These data can then be used 
to predict the course of some other reaction involving the com- 
pound AB. 


54. The Internal Energy of Microscopic Systems 


Naturally, the law of conservation of energy and the principles 
of energy exchange are valid for large bodies as well as for the indi- 
vidual particles of a body. However, when considering particles 
(nuclei, atoms and molecules), or systems consisting of a small 
number of particles, one other important law of nature must be 
taken into consideration, namely, the energy of a microscopic system 
cannot assume any arbitrary value. Each system has a sequence 
of possible values of internal energy, U1, Us, ..-, that is character- 
istic of it alone. Figure 214 (p. 487) shows the possible energy levels 
for a hydrogen atom. Similar diagrams may be drawn for the energy 
levels of other atomic systems. When imparting heat or work to 
a system, the energy of the atoms, molecules or other microscopic 
systems may increase only by specific, discrete amounts (quanta). 
In exactly the same manner, energy is given up to surrounding 
bodies in quanta. 

Strictly speaking, the law of the quantum character of energy 
and the existence of a “scale” of possible energy levels for each mi- 
croscopic system is a perfectly general law of nature that is valid 
for large bodies as well. However, as is shown in theoretical physics, 
the number of energy levels in a large body consisting of n atoms 
ee roughly speaking, n times the number of energy levels in a single 
atom. 

As the energy increases, the intervals between levels become small- 
er and smaller (see the diagram for hydrogen). The reduction in the 
interval between ‘energy levels is incomparably more rapid for 
a large body than for an individual atom and only the very lowest 
levels appear discrete. The higher levels merge and it appears as 
though a large body can change its energy continuously. If energy 
is taken away from a body, it “descends” to a lower level. Hence, 
the lower the temperature of a body, i.e., the closer it is to absolute 
zero, the sharper the quantum character of the energy changes. 

Mechanical action serves to shift the energy levels of a body or 
system, but in the overwhelming majority of cases this displacement 
cannot be observed. For microscopic systems—atoms and mole- 
cules—the effect of pressure is very small. 
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Thermal interaction consists in the transitions of a system from 
one energy level to another. 

Thermal equilibrium is mobile equilibrium. A body does not have 
a single energy all the time, but is rather continuously exchanging 
energy with its surroundings, so that on the average the energy 
remains unchanged. The exchange of energy occurs in discrete amounts 
or quanta. If at one instant the energy is equal to Uy, the next 
instant it has changed abruptly to (Gis AE 

Energy is given up in the form of radiation. If Uı > Uz, then 
U, — U» = hy, where v is the radiation frequency and h is Planck's 
constant. This constant is equal to 6.62 x 10-27 erg sec. Energy may 
be gained by absorbing radiation or as the result of a mechanical 
impulse from some particle. 

If the temperature of a body drops instead of remaining constant, 
this signifies that the number of transitions from higher to lower 
levels exceeds the number of transitions in the reverse direction. 


The energy decreases in jumps and the body emits one quantum of 
radiation after another. 


The diagrammatic representation of energy exchange was origi- 


nally Carried out only for atoms. Somewhat later, it became evident 
that this representation has general validity. We refer the reader 
to Part IIT of this book for further details on this subject. 


55. The Equation of State 


Three basic properties or parameters of st 
the various properties of a body. These 
ume v and the temperature 7. Knowi 
sufficient to exhaustively describe 
various substances, 


ate may be selected from 
are the pressure p, the vol- 
ng these parameters is not always 
a body. If a system consists of 
we must also know their concentrations. If 
a body is located in an electric or magnetic field, we must know 


the intensity of the field. However, it is always possible to select 


a group of parameters that will uniquely determine the state of 
a body. The oth 


€ E er characteristics may then be calculated from the 
basic parameters. 


Leaving electromagnetic fields out of consideration and restricting 
ourselves to simple 


selves Systems—gases, liquids and isotropic solid 
bodies —it turns out that only two parameters determine the state of 
abody. It is Immaterial which pair of parameters are selected from 
P, fand T. Usually, v and T are selected. The pressure p is then 
mA fùnction of v and 7. We call the equation 


portance in physics to be 
y and, in particular, for 


EWO 


a, 


— 
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a class of bodies. Equations of state may be established only exper- 
imentally. The nature of the dependence of pressure on volume 
and temperature for liquids and solid bodies varies tremendously 
from case to case. By establishing the equation of state for a given 
body, we are able to determine its behaviour under a variety of 
conditions, but this does not in any way enable us to determine 
the behaviour of other bodies. 

Quite often, the behaviour of a substance is not described by an 
equation of state, but rather by the derivatives of certain of the 
parameters with respect to the others. s 

To establish how a body expands with increasing temperature, 
at constant pressure, we must determine (=) 3 (the derivative of 


v with respect to T at constant pressure). The quantity 
=< ( ðv ) 
ets ôT }p 
is called the thermometric coefficient of dilation. As can be seen 


from the formula, æ gives the relative change in the volume of 


a body for a 1° change in temperature. 
The thermometric coefficient of change of pressure 


ys (32) 

"T eN OLN io m 

gives the relative change in the pressure for a 1° change in tempera- 
ture (at constant volume). The dimensionality of the coefficients 


æ and ĵ is given in reciprocal degrees. ee 
The third useful quantity is the compressibility 


3 i} (=) 
Saas op İT’ 


which gives the relative decrease in volume for a unit increase in 


pressure (at constant temperature). i . 
These three coefficients are connected by a very interesting rela- 


tionship, which is easily derived as follows: Since 


p=! T), 
then 4 4 
P diy (ee ; 
a= (F), 00+ (Et 
If the pressure is constant, then dp = 0 and ac 
ap ar avy _ : 
(2 AG aor ly 1. 
Hence 
Bx _ 1 
Gia ee © 
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This result shows that if we know, for example, the com pressibility 
and the thermometric coefficient of change of pressure, we can cal- 
culate the value of the thermometric coefficient of dilation. The 
derived relationship is valid for all bodies. 

Generally speaking, the coefficients œ, B and x are not constant 
quantities for a given substance. For various pressures and tempera- 
tures, these coefficients may assume various values. Hence, when 
the value of a coefficient is given, it is necessary to indicate the 
corresponding values of pressure and temperature, In some cases, 
the average value of the coefficients is given over a particular inter- 
val of temperatures or pressures. 


Here are some examples: 


a) The table gives the thermometric coefficient of dilation œ and the com- 
pressibility x for some liquids. 


a (deg-1) % (atm~1) 


Water, 10°-30°C, normal pressure 


2.07% 10-4 47.5 10-6 
Mercury, 40°-300C.. e a a 1.81x10-4 2.95Xx10-6 
Hither OCTA ES A a a 16.56 10-4 146 10-6 


For solid bodies, the thermometric coefficient of dilation 
sibility may vary considerably. Thus, at normal temperature and pressure, 
for fused quartz, œ = 1.29 X 10-6 deg! and x = 2.7 x 1076, while for 
ebonite, œ = 77 X 1076 deg! and x = 18 x 10-%) 

b) The values given in the table for B, the thermome 
of pressure, are calculated for water, 


oe Bu ) 
pressure (= +1]. 


and the compres- 


tric coefficient of change 
mercury and ether at atmospheric 


| Water | Mercury | Ether 
B (deg) fede i | 44 | 61.4 11.3 


This means that when the temperature 


; 3 of a constant volume of mercury is 
increased by 10-3 q 


egree, its pressure increases by 6 per cent (!). 


56. The Equation of the Gas State 


The simplest equation of state is that of 


obtained by Mendeleyev in th 
bines Cl 


a rarefied gas. It was 


; ; e form of a single formula that com- 
apeyron’s equation and Avogadro’s law. Clapeyron’s equa- 
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. Dv . . 
tion states that L is a constant for a given mass of gas: 


T> = const. 

Avogadro’s law states that gram molecules of different gases at 
constant temperature and pressure occupy equal volumes (22.44 
litres at a temperature of 0°C and a pressure of 1 atmosphere*). 
Hence, for one gram mole, the constant in Clapeyron’s equation 
is a universal constant. It is designated by the letter R and is called 
the unjversal gas constant. For a mole of any gas, the equation 
assumes the form 

pv = RE. 


Here v is the volume of a mole of gas. The constant R has the dimen- 
sions of work per mole degree. Expressed in various unils, its value is 


€ ihr gS _. g 94 joules a 
R=8.31 x 10 ET = 8.31 ag 


atm litres _ 9 calories 


mole deg  ~ mole deg ` 


=0.0821 


Since the volume of an arbitrary mass of gas is V = wv, where 
u is the number of moles, the equation of state for a rarefied gas 
assumes the following form in the most general case: 


pV =pRT or pV = =, RT. 


Here m is the mass and M is the molecular weight. 
This equation yields the following convenient formula for the 
gas density p: 
_ Mp 
RRAN 


Gases obeying the equation of the gas state are called ideal gases. 


The simplicity of the equation would in itself be sufficient grounds 
for fore S HORON as we shall see below (p. 193), this 
equation may be derived by assuming the gas to be represented 
in an ideal system. An ideal gas is then a system of molecules whose 
dimensions and forces of attraction may be neglected. 

For ideal gases, the coefficients of dilation, change of pressure 
and compressibility are given by the following simple formulas: 

1 — 

= p > a yr% ET . 


—_ 
* A physical atmosphere 
engineering atmospheres). 


is meant here (1 physical atmosphere = 1.033 
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At a temperature of 0°C (T = 273.13°) 
a=b- deg™ = 0.00366 deg™t. 


The following data show to what extent certain actual substances 
approach the ideal condition: 


a at V = const 


ERGOTOR E A Ase E 3,660 x 10-6 
It) AAE EAE Se 3,660 x 10-6 
Nitrogen eet on tee a 2s s 3,674 10-6 
Carbon dioxide ....... 3,726 x 10-76 
ANGE oye TRA a ae 3,674 % 10-8 


Gaseous substances under a pressure that considerably exceeds 
atmospheric cease to obey the formulas of an ideal gas. Calculations 
may lead to errors of several per cent at pressures of only tens of 
atmospheres, i 

An important conclusion to be drawn from the study of rarefied 
gases is that, generally speaking, any of these gases—and not only 
hydrogen—may be used as a basis for determining temperature. 
Hydrogen provides no particular advantage over other rarefied 
gases. It can, therefore, be said that the temperature scale adopted 


in physics is not a hydrogen scale, but rather a scale of pressures ` 


for an ideal gas. Herein lies the advantage of the method adopted 
for determining temperature, namely, the existence of a large class 
of substances leading to temperature scales that are identical. The 
kinetic molecular basis for the selection of the temperature scale 
will be given below (p. 193). 


57. The Equations of State of Actual Gases 


_ The equation of the gas state begins to yield very rough results 
for gases at high pressures, steam. close to saturation and a number 
of other cases. Other equations of state are, therefore, required for 
such cases. Some are determined experimentally, while others (the 
most well known being Van der Waals’ equation) have, qualita- 
tively, a theoretical basis. In any case, the validity of one or another 
equation can be established only through comparison of the results 
calculated by means of the equation with the results obtained exper- 
imentally. We shall now give some examples of equations of state. 


7.) 
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Naturally, the simplest correction that can be introduced into 
the equation for ideal gases takes account of the volume of the gas 
molecules. It is evident that a gas cannot be compressed to zero 
volume even if the pressure is infinitely large. Hence, the equation 
of state may be written in the form 


p(v—b)=RI, 


where b is a constant that takes account of the finite volume of the 
molecules. 

The greater the number of constants introduced into the equation 
of state the easier it is to achieve close agreement between experi- 
mental and calculated values, but the more difficult it is to predict 
changes by means of the formula. Excellent agreement with experi- 
ments over a broad interval of values for the parameters of state 
is obtained by means of the formula proposed by Beatie and Bridge- 
man. It contains five constants—A, B, a, b and c—descriptive of 


the substance: 


pata) 4 B)— A 


where 
e 


7 b 
A’=A(1—-<), B =B (1—7); e=: 
Dieterichi’s formula contains three constants—a, b and s: 
a 


ERTIY: 


p(v—b)=RTe 3 


Van der Waals’ equation contains two constants—a and b: 


(o+) (v—b)=RT. 


The merit of the last equation is that it correctly reflects the general 
character of the dependence between the parameters for all gaseous 
substances. However, for a given substance, it is not possible to 
select constant values for a and b in such a manner that the calcula- 
tions agree closely with measurements over a broad interval. 
Van der Waals’ equation is based on the following: The pressure 


: RT y 
satisfies the equation of the gas state, i.e., p=- >: W hen the forces 
of attraction between the molecules are neglected. But due to the 


mutual attraction of the molecules, the pressure on ae of 
a vessel should decrease by some value p’. Thus Pea ail 


olume of the molecules into consideration, 


Now, taking the finite v 


p= 25-8 or (p-+p’)(v—5) = RT. 


11-1409 
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a > 
Why does p’ = ae Here, we reason as follows: Let us consider the 


gas volume divided into two parts. One part, then, attracts the 
other. The forces of attraction are proportional to the number of 
molecules in the left-hand part and to the number of molecules in 
the right-hand part. In other words, the forces of attraction are 
proportional to the square of the density, i.e., inversely proportional 
to the square of the volume. 

The forces of attraction between molecules will be discussed in 
more detail in Part III. 


CHAPTER X 


THERMODYNAMIC PROCESSES 


58. Graphical Representation 


If two parameters of state are given for a body, the third may 
be calculated by means of the equation of state. Thus, when one 
parameter (e.g., pressure) is plotted along one axis and a second 
parameter (e.g., volume) is plotted along the other’ axis, the state 
of the body is uniquely described by a plotted point. 

To be sure, it has been assumed in the graphical representation 
that the state of the body is in equilibrium. Only then will the 
values of the parameters of state be the same throughout the volume 
of the system and are we justified in speaking of the temperature, 
pressure, density, etc., of the body (or system) as a whole. . 

The question arises: What kind of process can be involved if 
equilibrium states are being considered? The answer consists in the 
following: In a process that proceeds sufficiently slowly, the values 
of the parameters of state throughout the volume are equal. Such 
a process may be considered to be a continuous succession of equi- 
librium states. It is reversible and may occur in either direction. 
A process consisting of a succession of equilibriums may proceed 
from state Z to state 2 and then from state 2 back through all the 
intermediate states to state 1, without producing any changes in 
the surrounding medium. s 
_ A reversible process is an idealised process. Every actual process 
is in one way or another irreversible, depending on how far away 
the intermediate states of the process are from equilibrium. 

This becomes clear from the following reasoning. Every establish- 
ment of equilibrium is irreversible. Many simple and familiar exam- 
ples can be cited: the cooling of a body by placing it in a cooler 
surrounding, the “dissipation” of a mechanical deformation, e.g., 
the restoration of a compressed spring to its undeformed position 
upon being released, the spontaneous intermixing of two gases, etc. 
Reversible processes cannot proceed of themselves. They cannot 
be single processes occurring in a closed system. 

An actual process does not consist of a succession of equilibriums. 
Inevitably, such phenomena occur as those enumerated above. Hence, 
when the process is made to proceed in the reverse direction, it will 
never pass through exactly the same states as during the forward 
direction. When a gas is rapidly compressed, the pressure of the gas 
in layers close to the piston will be higher than elsewhere. On the 

ate 
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other hand, during the reverse process—expansion of the gas—the 
pressure near the piston will be lower than elsewhere. 

Nevertheless, in spite of the fact that reversible processes are an 
idealisation, they are of great interest since in many cases the differ- 
ence between actual and reversible processes is insignificant. It all 
depends on the relaxation time, i.e., the time it takes for equilibrium 
to be established. This time varies within very broad limits—from 

~10- sec, the time it takes for the 

2 pressure to equalise in a homogeneous 

gas, to minutes, hours, or even weeks, 

for processes involving heterogeneous 
substances. 

Let us assume that a gas is com- 
pressed and the entire process takes 
one second. The relaxation time is 
an insignificant fraction of a second. 
We may therefore consider the actual 
process as a succession of equilibrium 
states and draw the curve on a graph 

Fig. 75 showing p and v, or on some similar 

diagram. The same holds true for all 

other processes in which the relaxation time is small with respect 
to the duration of the process. 

Fig. 75 shows several curves representing simple processes. The 
coordinates of the graph give the pressure and the volume. In engi- 
neering thermodynamics, other coordinates are used in addition 
to these, but we need not discuss them. The vertical line / in the 
figure represents a process at constant volume. If the point generating 
the curve is moving upwards, the pressure is increasing; if the reverse 
is true the pressure is falling. It is clear that a change in tempera- 
ture occurs during this process that is not “seen” on the diagram. 
The horizontal line 2 represents a process at constant pressure (iso- 
baric process). Moving from left to right signifies expansion, while 
the reverse corresponds to compression. The curve designated by the 
figure 3 corresponds to an expansion accompanied by a drop in pres- 
sure, while curve 4 represents an expansion in spite of the increasing 
pressure. The change of temperature in any process may be calcu- 
lated by means of the equation of state. : 

In most thermodynamic processes, all the parameters of state are 
changing simultaneously. Nevertheless, a number of simple but at 
the same time practically important exceptions may be singled out. 
These include the processes mentioned above, namely, at constant 
volume (isochoric) and at constant pressure (isobaric), as, well as 
the processes occurring without heat exchange (adiabatic) and at 
constant temperature (isothermal). 
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59. Work and Cycles 


In mechanics. work is usually represented as the product of 
a force and a distance. In thermodynamics, we are usually interested 
in the work of changing the volume of a body. Fig. 76 shows the 
shape of a body in two states. The volume is shown to have changed 
from v; to vy. The total work of changing the volume may be consid- 
ered as the summation of the work expended in displacing the ele- 
ments of area dS by the distance dl. If the applied forces are perpen- 
dicular to the surface, the work of displacing a surface element is 
equal to fdl or, in terms of pres- 
sure, pdS dl. Thus, 


dA=pdbv, 


where dv is an element of volume 
change. It is evident that the total 
work is given by the definite inte- 
gral: 


vg 
A= $ pdv. 
i 


In a pressure-volume diagram, the 
work of compression or expansion 


can be represented geometrically. = 
It is simply the area under the curve (bounded on the left and the 


right by vertical lines through the points representing the initial 
and final values of the volume). { 
If the pressure during the process of compression or expansion 
remains unchanged and if, moreover, it is the same at all points 
on the surface, then p may be brought out from under the integral 
sign and the formula for work becomes 
A=p(v2.—"%)- 


As we have already stated, work may be considered positive or 
negative depending on the convention adopted. We have assumed 
that work is positive when a body does work on the surrounding 
medium, i.e., work of expansion. Accordingly, work of compression 
is negative. 

If a body is transferred from state Z to state 2 as a result of some 
process, and then transferred back to its original state via the same 
path, the total work of such a process is naturally equal to zero. 
Here, the work of expansion, performed on external bodies, is equal 


to the work of compression, performed by external bodies on the 


system under consideration. However, the situation is completely 


Fig. 76 
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different when the “forward” and “return” paths differ. Processes in 
which a body is returned to its original state via a different path 
are called cyclical processes. Fig. 77 shows two such cycles and the 
arrows indicate the direction of the processes. One process proceeds 
in a clockwise direction and the other in a counterclockwise direc- 
tion. A process going from left to right signifies expansion. Thus, 
in the clockwise cycle the work of expansion is greater than the 


P P 


U VA 


work of compression. In this case, work is performed on the sur- 
rounding medium. It is evident that in the counterclockwise cycle 
a certain amount of work is performed on the system under consid- 
eration. In either case, the work performed during a cycle is rep- 
resented by the enclosed area (hatched in the figure). 


60. Processes Involving a Change of Gas State 


Let us consider the relations for the four simplest processes involv- 
ing a change of gas state, whereby, in the main, we shall restrict 
ourselves to gases obeying the equation of the gas state. It will be 
presently seen that knowing the equation of state for a substance 
and applying the first law of thermodynamics, a number of valuable 
conclusions may be drawn regarding the behaviour of the body under 


‘various conditions. The first law of thermodynamics for gases will 
be used in the form 


AQ=dU + pdv. 


The Isochoric Process. At constant volume, the fist law of thermo- 
dynamics assumes the form 


AQ=dU. 


Heat exchange occurs between the system under consideration and 
the external medium, but no work is performed on the external 
medium or the system under consideration. Two possibilities exist: 
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either 1) the body absorbs heat from the surrounding medium and 
its internal energy increases Or 2) the body gives up heat to the 
medium and its internal energy decreases. 

The quantity of heat required to increase the temperature of a 
body by one degree at constant volume is called the thermal capacity 
at constant volume and is designated by the letter c with subscript v: 


c= (a7) 
eE NEI v=const ` 


If the dependence of the internal energy of the gas on the temper- 
ature is known, the thermal capacity c» may be calculated. 

At high temperatures, there is a linear dependence between the 
internal energy of a gas and the temperature, since the thermal 
capacity in this case does not depend on the temperature. 

We are unable here to prove an important theorem. If follows, 
however, from the general laws of thermodynamics that if the 
dependence between p and T is linear, then c, cannot depend on 
the volume. Since such a linear dependence exists for gases obeying 
the equation of the gas state and Van der Waals’ equation, then c, 
does not depend on v for gases and the phrase “at v = const” may 
be omitted in the above formula. Thus, 


d 
oe a (for gases). 


If the dependence of c, on temperature is only slight, the internal 
energy of a gas may be represented by the formula 


U = c,T + const. 


For ideal gases, the constant does not depend on the volume and 
may be dropped. For a gas obeying Van der Waals’ equation, the 


a 
constant equals — -7 - Thus, 


U=c,T for an ideal gas 
and 
E o the © for a gas obeying Van der Waals’ equation. 
v 
We see that, in the case of an ideal gas, a change in the volume of 


the gas when the temperature is maintained constant does not involve 
a change in energy (see P- 475). If the molecules are drawn together 


, a z . 
with a force per unit area of p’ = g then upon expansion of the 
gas the energy increases by the amount of work done against this 
force, i.e., by 
A pees 
\p dv= = -+ const. 
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The Isobaric Process. In this process, all three terms in the equa- 
tion for the first law of thermodynamics are different from zero. The 
system exchanges heat and work with its surroundings without 
a change of pressure occurring in the system. This process usually 
involves absorption of heat by a body from its surroundings; however 
not all the heat serves to raise the internal energy of the body and, 
in part, it is returned to the surroundings in the form of mechanical 
work. We shall not consider other cases. 

It is perfectly evident that the thermal capacity in this process 
will differ from that in the isochoric process considered above. In an 
isobaric process, the heat is used not only for raising the temperature. 
Hence, cp (thermal capacity at constant pressure) must be greater 
than c,. The difference may, in some cases, be calculated. 

Let us divide both sides of the equation for the first law of ther- 
modynamics by an incremental temperature: 

_ AQ__ du dv 
oan ar Pap 
This expression for the thermal capacity is valid for any process, 
including the isobaric process under consideration. For gases, this 
formula may be rewritten as follows: 


due 
Cp = co Pap - 


For an ideal gas, the result obtained is very simple. Since pv =uRT, 
then aa and cp = c, + uR. Thus, the difference between the 


thermal capacities at constant pressure and at constant volume is 
equal to the number of moles of gas multiplied by the universal gas 
constant. Then, for molar thermal capacities, 


Cp—G=R. 
Since Ræ 2 cal/mole deg, 
Cp —Cy=2 cal/mole deg. 


The Isothermal Process. In order to avoid confusion, it should 
first be emphasised that constant temperature in no way signifies 
that no heat exchange occurs between the system and the surround- 


ings. A system may absorb heat from the surroundings but not use ~ 


it to raise the temperature. Thus, as is well known, the internal 
energy of a body may increase at constant temperature (e.g., melting 
ice). Moreover, for gas processes, there is another possibility (more 
important than the first): A system may return part of the heat 


received from the external surroundings in the form of mechanical 
work. 


ee 
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In the case of-actual gases, both ways of expending the heat are 
entirely possible in an isothermal process. The heat transferred to 
a gas causes the gas to expand without the temperature being raised, 
whereupon 1) work is performed on the external surroundings and 
2) the potential energy of the interacting molecules is increased. 

In the case of an ideal gas, whose internal energy depends only ` 
on the temperature and therefore cannot change in an isothermal 
process, the first law of thermodynamics assumes a particularly simple 
form. Since dU =0, then AQ = AA. Hence, either the system ex- 
pands, absorbing heat from an external source and performing work 
on some object, or, conversely, the system contracts, releasing heat 
and obtaining energy in the form of mechanical work from external 
bodies. An ideal gas transforms energy in an isothermal process. 
It obtains energy from the surroundings in one form and returns all 
of it to the surroundings, but in another form. 

It is not difficult in the case of an ideal gas to go over from the 
differential form AQ = pdv to the integral form. The work (we 
can just as well say heat since work and heat are equivalent) of 
an isothermal expansion from volume v, to volume və is 


va 


A= \ pdv. 


vi 


Substituting for pressure from the equation of the gas state, and 
bringing the temperature out from under the integral sign since it 
is a constant, we obtain 
v2 
dv ve 
A=pRI f = uRT In. 
v 


It should be noted that the work of equal numbers of isothermal 
expansions at different temperatures differ, being greater the higher 
the temperature. Thus, doubling the volume of a mole of some ideal 
gas at a temperature of 300°K (room temperature) requires 8.31 X 
x 300 X In 2 = 1,730 joules of work, while at a temperature of 
3,000°K. the work required is ten times as much, i.e., 17,300 joules. 

‘An actual isothermal process may be difficult to achieve. In any 
case, in order for the process to be even approximately isothermal, 
the walls of the vessel through which the substance comes in contact 
with the surroundings must be perfectly heat-conducting. Moreover, 
the process must proceed very slowly, so that the heat (or work) 
is able to return to the surroundings in the form of work (or heat) 
instead of accumulating in the system. 

Pe ea ed on, een amd pein 
at exchange with the surroundings. This 
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may be achieved by providing conditions that are in a sense the 
reverse of those for an isothermal process, i.e., perfect thermal insu- 
lation must be provided and the process must proceed very rapidly, 
so that heat is not able to escape from the system or be transferred 
to the system. In the case of compression, in accordance with the 
first law of thermodynamics, which is now written in the form 


pdv= — dU, 


the mechanical work is converted into internal energy of the body. 
In the case of expansion, on the other hand, the work is performed 
at the expense of a decrease in the internal energy of the system 
under consideration. 

For the three processes considered above, the changes occurring 
in the pressure, volume and temperature were quite apparent, and 
for gases followed directly from the equation of state. In an adiabat- 
ic process, the nature of the change in the parameters of state is not 
apparent, since all three parameters of state change. The simulta- 
neous solution of two equations—the equations of the gas state and 
the first law of thermodynamics—enables us to determine the rela- 
tionships. Since only the principle involved interests us, we shall 
restrict ourselves to an ideal gas in order to simplify the mathemat- 
ical calculations. Using the expression for the thermal capacity 


d f 
of a gas at constant volume, a =c,, and replacing the pressure 


by ey , we obtain: — pias = aT . Assume that in the initial state 
P v T 


Cy U 
the gas parameters are vi, py, T, and in the final state V2, Po, To. 
Integrating the last equation from the initial to the final point of the 
adiabatic process, we obtain 


1 


2 2 

uR dv f dr ; HR, ve To 

= (2 To i ln 7 ln T” 
1 


1 


c 
Recalling that Cp — c, = uR and introducing the designation = = 
v 
we obtain 


La (vy \¥-1 

aas 
lt is seen from this equation that for adiabatic compression the 
temperature increases and for adiabatic expansion falls. Various 
examples can be cited. Thus, a gas is rapidly expanded when we 
desire fo cool it and carbon dioxide gas escaping from a gas tank 
may turn into dry ice due to the tremendous drop in temperature 
of the expanding gas. On the other hand, adiabatic compression 
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may be used, for example, to ignite some substance. The following 
experiment is often demonstrated: A wad of cotton wool soaked 
in ether is placed in a vessel containing air. Rapid compression of 
the air by means of a plunger causes the cotton wool to burst into 
flame. 

Since we wish to represent the gas processes on a pressure-volume 
diagram, it is necessary to convert the above equation of the adiabat- 
ic process into an appropriate form. Substituting for the tempera- 
ture by means of the equation of the gas state, we obtain 


Pw) = pav}. 


Comparing this equation with the Boyle-Mariotte law for an isother- 
mal process, important differences may be observed in the nature 
of the pressure change for compression or expansion. For an isother- 
mal expansion or compression, the product pv remains unchanged, 
while for an adiabatic process the product pv’ remains unchanged. 
Since y > 1, the adiabatic on the diagram is steeper than the iso- 
thermal. When the volume is reduced to one-half in an isothermal 
process the pressure doubles, while in an adiabatic process the 
increase is greater. For example, in the case of most diatomic gases, 
for which y = 1.4, when the volume is reduced to one-half, the 
pressure increases to 2.63 times its original value. 

It has already been emphasised that both processes are of an ideal 
character and that the requirements for the creation of the ideal 
Conditions of these processes are opposite. Therefore, it is evident 
that gas processes under actual conditions yield intermediary 
curves between the adiabatic and isothermal curves. 

The difference between the adiabatic and isothermal curves may 
be easily visualised as follows: For adiabatic compression, the gas 
becomes heated, so that for one and the same reduction in volume 
the increase in pressure is greater in the adiabatic process, since 
heating at constant volume leads to an increase in temperature. 

Fig. 78 shows that the work of isothermal expansion is greater 
than the work of adiabatic expansion. On the other hand, the work 
of isothermal compression is less than the work of adiabatic compres- 
sion. We are assuming, naturally, that the initial points of the proc- 
esses coincide. y 2 : 

The work of an adiabatic process may be determined graphically 
or by means of formulas. From the first law of thermodynamics, it 
follows that in adiabatic processes the work must equal the change 


in internal energy: 


A= pdv=U,—U3. 
1 
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In the case of ideal gases, the difference in energy is calculated sim- 
ply as U, — Uz = c, (Tı — Tə). Thus, for ideal gases, the work 
may be calculated by means of this formula. 

Another method may be used to determine the work in an adiabat- 
ic process. Since the equation pwy = pv holds for any intermediate 
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point of the process, where the symbols without subscripts designate 
the current values of pressure and volume respectively, the work 
integral may be written in the form 


Upon integrating from the initial to the final point of the process, 
we obtain 


Be, pw? (==- 
yt (oi 


Naturally, this formula is equivalent to A = c, (7; — Ts), which 
is easily demonstrated by using the equation of state of an ideal gas 


and converting the above formula (taking 
to the form 


pat out of the brackets) 
1 


a= or 1-2)" ] t é 


Depending on the given data, one formula may be more conveniently 
used than the other. á 


Let us illustrate by a simple numerical example the statement made in 
Chapter IX to the effect that the increments AQ and AA (note: AQ ~ AA) are 
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not total differentials, i.e., they do not characterise the change of state ofa 
system. 
Assume state (1) of a mole of hydrogen (Fig. 79) is characterised by the 


following data: vy= 20 litres, T, = 300°K and py EE Sy cal/litre 
= 


© n T . 1 . . 
(here R = 2 cal/deg). Now, cp — ¢ = R and, since hydrogen is a diatomic 


e 
Pp > 1 
gas, —-=1.4. Hence cp = cal/deg and 
n P 


l 2 


Co 
ĉ = 5 cal/deg. 

We shall now consider three possible 
paths by means of which the gas can change 
to state (3), where- v3 = 40 litres, 
T3 = 300°K and p3 = 15 cal/litre. 

Path 1-3. The work along the isotherm 
A-s = RT; In 2 = 416 cal. These 416 


calories are absorbed from the hot body, 
and the internal energy U = const since 
1 = T3. 

Path 1-2-3. Here, (1-2) is an isobar. 
Hence, 7 = 600°K. The heat absorbed 
from the hot body is Qi- = ep (T2 — 11) = 
= 2,100 cal and the work against the Fig. 79 
external forces is Ay-2 = pı (v2 — u) = 
= 600 cal. Therefore, the internal energy 
of the gas increases by AU = 2 100 — 600 = 1,500 cal. Process 2-3 consti- 


tutes isochoric cooling and Qz-3 = ĉ» (T2 — T) = 1,500 calories of heat are 
transmitted to the cold body. Since vg = vg, nO mechanical work is per- 


formed. 5 
Thus, al the path 7-2-3, the hot body gave u 2,100 cal, 600 cal of work 
Senet maa He i E500 cals ‘Along the path 7-3, the 


was performed and the cold body absorbed 1,9 i 
hot body gave up 416 cal, 416 cal of work was performed and no change in 
the state of the cold body occurred. However, the change in the state of the gas 
was the same for both cases. 
Path 1-4-3. Here (1-4) represents an adiabatic process, whille (4-5) 
1 
1 


1 

4 5 y = ene 

is an isobar. Now, pa ee (4+) =2Y, so that v,=20X2% litres. From 
4 

1 


vy 

f yi eT 

fi = (21)! _ we find that 7,=300x2" . Along the 
1 v4 

path 7-4, the work against the externa 


the relation 
l forces is performed only at the expense 
i 


of a decrease in the internal energy: 4—4=— tv (T, —T,)=1,500 (1—2 Y Deal 


AEI 
Along the path 4-3, the hot body gives up Q4-3=°p (T3—T 4) =2,100 (1 LONE 
cal of heat and the work against the external forces equals 4,_3=P (vg—v4) = 
4 
2.4 
= 600(1—2Y ) cal. 


Therefore, along the path 4-3, the internal energy increases just by 


1 
1,500 a277) cal. Thus, the path 7-43 has also not led to a change 
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in the internal energy of the gas, which is uniquely determined by the 
temperature. : 


Measuring the Thermal Capacity of Gases. It would appear that 
the easiest way to determine the thermal capacity of a gas is to fill 
a container with the gas to be measured and immerse it in a calorim- 
eter. However, this does not take into account the fact that the 
thermal capacity of a gas is very small with respect to the thermal 
capacity of a container, no matter what solid material is used to 
make the container. Therefore, the thermal capacity of a gas is not 
measured at constant volume, but rather at constant pressure. For 
this purpose, gas under constant pressure moves in a coiled pipe 
that passes through the calorimeter. By means of a thermocouple, 
the temperature of the gas is measured at the input and output of 
the calorimeter. After preliminary heating, the gas enters the calo- 
rimeter and transfers part of its heat to the water. Knowing the 
quantity of gas passing through the container during a particular 
period of time, and the quantity of heat transferred to the water of 
the calorimeter during the same interval of time, the thermal capac- 
ity of the gas at constant pressure, Cp, can be easily determined. 
This is done by dividing the quantity of heat by the mass of flowing 
gas times the difference in gas temperature between input and out- 
put. " 


To determine the thermal capacity at constant volume, we use 


the ratio of thermal capacities, i.e., Poisson’s coefficient: y = 2 s 
v 

Many methods of determining y have been proposed, some of them 
being based on the measurement of the volume and pressure of the 
gas in a succession of states for an adiabatic process. Other relation- 
ships between the thermal capacities may also be used, e.g., the 
relationship defining the difference between the thermal capacities 
Cp and c,. 


The thermal capacities of various gases are given in the table. 


Gas Cy p Y 
cal/deg mole cal/deg mole 
onti a .). 2.98 5.00 1.67 
Hydrogen H, ..... 4.87 6.87 1.44 
Nitrogen No... a. 5 4.86 6.84 1.44 
Oyye Os 6 eu eh eae 4.99 6.90 1.40 
Water vapour HzO . . . 6.65 8.65 1.34 
Methane CH, ..... 6.51 8.51 1.30 
Ethyl alcohol C,H;0H 18.9 20.9 Ae! 


' 


gi 


61. The Joule-Thomson Process 17. 


61. The Joule-Thomson Process 


This is the process in which gas is allowed to flow through a small 
opening from a region of high pressure p, into a region of low pres- 
sure pə. The vessel in which the process takes place is thermally 
insulated from the surroundings. 

In accordance with the conditions of the process, p, and pẹ must 
not change. This is done by having both pistons (Fig. 80) move to the 
right, corresponding to the pas- 
sage of gas into the region of 
low pressure. 

M, the mass of gas that is 

moved from left to right, does not 
Maintain constant volume, but 
changes from v; to vs, for it has 
entered a region of different pres- 
sure. This transition is accom- 
plished under the action of the 
left piston and the counteraction 
of the right one. The left piston Fig. 80 
does work at the constant pres- 
Sure p. This work is equal to p Av, where Av is the change in the 
volume of the gas to the left of the partition. But v, is the change 
in volume on the left, so that the work done by the left piston is 
Pix. The right piston does negative work, which in this case is equal 
to the product of the pressure ps and the incremental volume Vo. 
Thus, when the mass of gas M is transferred from the left region to- 
the right one, the work performed is piv; — pw. The law of con- 
Servation of energy requires that the internal energy of the gas. 
change by this same amount. Therefore, 


U, — Uy = pp — Pra 


This formula is valid for any mass of gas. This means that in the: 
Process of moving a gas from one vessel into another the quantity 


U + pv =const 


‘(called the heat function or enthalpy) does not change. 


For an ideal gas, U and pv both depend only on the temperature. 
Thus, during a Joule-Thomson process, the temperature of an ideal 
8as does not change. . y 

For actual gases, the situation is different. If the gas is not ideal, 
the temperature may increase or decrease during a Joule-Thomson: 
Process, depending on the nature of the interacting forces between 


the molecules. 
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It is noteworthy that a particular gas may behave differently at 
different temperatures. At a high temperature, the temperature 
increases during a Joule-Thomson process, while at a low temper- 
ature, it decreases. The point of inversion corresponds to the temper- 
ature at which the change in sign ocgurs. This temperature for oxy- 
gen and nitrogen is above room temperature. Therefore, the air is 
cooled during a Joule-Thomson process conducted at room temper- 
ature—not to speak of lower temperatures. For hydrogen, the inver- 
sion temperature is very low. The Joule-Thomson effect below the 
inversion temperature finds application industrially in the lique- 
faction of gases. 


CHAPTER XI 
ENTROPY 


62. The Principle of Entropy Existence 


In the middle of the last century, an important discovery was 
made regarding reversible thermodynamic processes. It was found 
that side by side with internal energy a body has yet another remark- 
able function of state, namely, entropy. Just as in the case of inter- 
nal energy, entropy includes an arbitrary constant. Experiments 
yield the incremental difference in entropy. If a body or system 
absorbs the heat AQ during an infinitesimally small transition from 


one state to another at a temperature T, the ratio a is a total differ- 


ential of some function S. This function is the entropy and is thus 
determined by the two following equiyalent equations: 


2 
AQ _ Ag 
aS =** and Sim Ve 


i SANON 
The statement that a function exists whose differential is =æ is 


known as the principle of entropy existence. It is one of the most 
important laws of nature and an essential part of the second law of 
thermodynamics, which will be discussed below. The discovery of 
this principle, as well as the entire second law of thermodynamics, 
is primarily associated with the names of Carnot and Clausius. 
n spite of its somewhat abstract nature, the essence of the principle 
is easily understood and may be summarised as follows: A body may 
change from one state to another in an infinite number of ways (re- 
Presented on a diagram by the various curves connecting the same 
initial and final points); and, although the body may Mba various 


eae ; AQ _ ey) « 
amounts of heat during such transitions, the integral en will in 
1 


A : z 
all cases have the same value. = , the ratio of the quantity of heat 


to the temperature at which this heat was absorbed, is sometimes 
Called the P aduca heat. Since an integral may always be approxi- 
mately represented as a summation, the change of entropy in trans- 
ferring from one state to another is equal to the summation of the 
reduced heats. Let us assume that the body absorbs a calorie per 


12-1409 
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degree as it is uniformly heated from 20°C to 25°C. The increase 
in entropy is then 

S Sıx Aea , teal , teal , deal , 4 cal 

Sis pa2D32De 204.5% © 295.59 © 296.5" "297.5" * 


For isothermal processes, the expression for the change in entropy 
is very simple: 


SS=, 
where Q is the heat absorbed during the process. Thus, when 4 kg 
of ice melts the entropy of the substance increases by eusa 
= 0.29 H, 


In applying the concept of entropy, the value of entropy at any 
state (e.g., boiling water or melting ice) may be adopted as zero 
entropy. However, in some cases, the value of the entropy of a sub- 
stance at absolute zero is adopted as zero entropy. There is, inciden- 
tally, some theoretical basis for this (Nernst’s theorem), but we 
shall not go into it. 

Assuming S = 0 at T = 0, the entropy of a substance at the 
temperature T may be determined by the formula 

T 


+ cy aT 
sa (at, 
0 


wr, 


if the heating occurs at constant pressure. As can be seen, the depend- 
ence of the thermal capacity on the temperature must be known in 
order to determine the entropy. 

The entropy may be easily calculated (except for the arbitrary 
constant) if the equation of state of the substance is known. By defi- 
nition, dS =e. Substituting the value for AQ obtained from the 
equation for the first law of thermodynamics, we obtain 


aT dv 
dS = Cy + PF 
By means of the equation of the gas state, we can eliminate the 
i i ini ar dv mys 
pressure from this equation, obtaining: dS = tyr +R z . Taking 
the indefinite integral, we obtain an expression for the entropy that 
includes an arbitrary constant: 


S=c,In7+pRInv + const. 
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It is also possible to take the definite integral of dS, where the 
limits are two states. We then obtain the following’ expression for 
the entropy difference between the two states: 


T2 Dy ? 

Sa — Si =C In T, uR ln T 

This is the expression for the entropy of ideal gases.- It is seen from 
the formula that the entropy increases as the temperature and the 
volume of the gas increase. Naturally, this is in agreement with the 
general statement that the entropy increases when heat is trans- 


ferred to a body. 


Example: Using the example on p. 173 (Fig. 79), we shall show that the 
entropy is indeed a function of the state of a system: 

Path 1-2-3. The change of entropy 
cal 


p i Ts v2 5 In 2-4-2 1n 2=7 In 2 ——_— 
S2— Sı =c Ine + Fla oid elie ln a ole deg * 


t 1 5 In 2 cal 
The change of entropy S3s—S2:=5 In > ° iD “hole deg ` 


A cal 
change of entropy along path 1-2-3 is S3—Sy=2In2 ole deg ` 


The total 


cal 


D, x =f 2——_. 
Path 1-3, Sg—Sy=2 ln ? olo dog 
Path 1-4-3. Since (1-4) is adiabatic, S,—S,=0. 


5 
Ie, 1 
LOY e ee jt Ses 
+21n2 =21n2 ole deg” 


ao 


1— 
Ts 4 91n 8 =5In2 
S3—S,= cy In T; +21ln ae n 
It is seen, indeed, that no matter how the transition of the gas from state (1) 
to state (3) is effected the change of entropy is the same. 


63. The Principle of Increasing Entropy 


As already stated, reversible processes, strictly speaking, do not 
exist. However, many processes occur that do not, practically, 
differ from reversible ones. But there are some processes that are 
always unidirectional and as a result can never be made reversible. 
Thus, gas may expand of itself, but it cannot be compressed without 
the application of an external force. Heat may spontaneously pass 
from a hot body to a colder one, but it can pass from a cold body to 
a hotter one only if work (e.g., electric energy) is expended. In the 
Presence of friction, the kinetic. energy of macroscopic motion is 
always converted into internal energy, but the reverse process never 
occurs spontaneously. All other irreversible processes are in the 
final analysis based on the fact that in each of them, to one degree 

42% 
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or another, one of the enumerated unidirectional processes occurs. 
In actual processes, it is impossible to avoid spontaneous expansion, 
friction and thermal dissipation. 

Do not all the enumerated unidirectional processes have a common 
characteristic? As a matter of fact they do: in all unidirectional 
processes, the entropy increases. 

In the case of heat exchange between two bodies, the overall 
change in entropy of the entire system is S, — Si Ft Fe 
where Q; is the heat absorbed by the colder body and Q% is the heat 
given up by the hotter body. 


If T, is greater than T4, then Q, = — Q, > 0, Since heat trans- 


ferred to a body is considered positive. Hence, S — S, = 
=Q; (z-z) >0, i.e., during heat exchange, there is an increase 


in the overall entropy of the system in which the heat exchange 
occurs. 

Let us take another case. Assume intensive mechanical motion 
{e.g., rotation of a wheel) takes place in a vessel containing gas. 
The volume does not change, but the temperature increases and, 


th 
hence, the entropy changes by S, — S; = c, ln a i.e., it also 
increases. si 


Finally, upon expansion into an evacuated vessel at constant 
temperature, the increase in entropy, Sz — S, = uR mee , is again 
1 


positive. Thus, in all unidirectional processes, the entropy of the 
system increases. 

It is easily seen that this conclusion regarding all irreversible 
processes is of great importance. Since each irreversible process is 
accompanied by unidirectional effects serving to increase the entro- 
py, the increase in entropy in an irreversible process is greater than 
the increase that would have occurred if the process were reversible. 
Let AQ be the heat absorbed by a body at temperature T in an irre- | 
versible process. If the process were reversible, the increase in entro- 


py would equal = In an actual process, however, the increase in 
entropy is greater than this value: 


ds > 52, 


If the system is thermally insulated, then AQ = 0 and the above 
expression assumes the form dS > 0, i.e., in a thermally insulated 
system, only processes serving to increase the entropy are possible. 

It is quite clear that entropy and internal energy are the most 
important functions determining a thermodynamic process. Thus, 


64. The Principle of Operation of a Heat Engine 181 


if entropy is analogous to the manager of a process, internal energy 
is analogous to the bookkeeper. While entropy determines the direc- 
tion of flow of the process, the energy “meets the expenditures” of 
conducting it. 

If in the above formula the symbol > is used instead of the sym- 
bol >, the law of entropy for reversible and irreversible processes 


may be described by the following simple formula: dS sie . This 
formula expresses the essence of the second law of thermodynamics. 
For closed systems, the second law states that the entropy of a ther- 


mally isolated system increases or remains the same. 
Both laws of thermodynamics may be combined in the single 


formula dS > = which is applicable to all practical ther- 


> 


modynamic problems. 


_ Examples. 1. In the example on p. 68, we considered the nonelastic colli- 
sion of a bullet with a ballistic pendulum and showed that, upon collision, 
399.6 kg-m of mechanical energy are dissipated in the bullet-pendulum system. 
This means that the bullet irreversibly transfers AQ = 935 cal to the pendulum 
through heat conduction. If it is assumed that the process is isothermal (i.e., 
the thermal conductivity of the pendulum is extraordinarily high), and that 
the temperature is, say, 27°C, then the entropy of the system in this irre 


versible process will increase by - 
as=A@ 23.12 cal /deg. 


t 2. A rubber ball weighing 0.3 kg rises 1 metre off the floor after being dropped 
rom a height of 2 metres. In this isothermal process (assume t = 27°C), 
We transfer AQ = 0.3 kgm irreversibly, i.e., the entropy of the ball-floor 


System increases by 
AS = 2.35 X 10-8 cal/deg. 
i If the ball and floor were absolutely clastic, the entropy would not have 
changed (AS = 0) and the motion of the pall would have continued eternally. 
jn the transfer of heat 


3. Let us consider the irreversible process involved tran: 
Assume that the steam poiler is at a tem- 


mperature tz = 30°C. For a boiler 
thermal capacity of 10,000 kW and an efficiency of 25 per cent, 7.5 X 10° joules 
aay transferred from the boi, jes y second. Since 
ve boiler lo: its AQ will be negative, 1-8., 

or the ASI sthe increases. However, since 
| >To, the entropy of the boiler-condenser system W 


“ AS=A0 (4-7) — 9,81 x 103 cal /deg. 
2 1 
64. The Principle of Operation of a Heat Engine 
In other words, it takes 


A heat engine converts heat into work. In í 
heat from some bodies and transfers it to others in the form of mechan- 


‘cal work. In order to accomplish this conversion, we must have 
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at our disposal two bodies at different temperatures, between which 
heat exchange is possible. The hotter body will be designated as the 
hot body and the colder one as the cold body. In the presence of two 
such bodies, the process of conversion of heat into work may be 
described as follows: A substance capable of expanding (the working 
substance) is brought into contact with the hot body. Heat Q, is 
taken from the hot body and is expended on the work of expansion, 
A, which is transmitted to surrounding bodies. The working sub- 
stance is then brought into contact with the cold body and transfers 
heat Q, to it at the expense of the work A, performed on the working 
substance by the external forces. s 

To obtain a continuously operating heat engine, the compression 
process must be concluded where the expansion process began. In 
other words, the overall process must be cyclic. The working sub- 
stance returns to its initial state at the end of each cycle. Hence, 
the law of conservation of energy requires that the energy obtained 
from the surrounding bodies equal the energy transferred to the sur- 
rounding bodies. The working substance obtained the heat Q, dur- 
ing expansion and the work A, during compression. On the other 
hand, it gave up the work A, during expansion and the heat Q, 
during compression. Hence, Q, -+ A,=Q2+ A; or Ay — Ay = 
= Q, — Qa. When the cyclic process is conducted clockwise, the 
work of compression is less than the work of expansion. Therefore, 
the last equation expresses the simple fact that the network trans- 
mitted to a working substance by an external medium is equal to 
the difference in the heat absorbed from a hot body and given up 
to a cold body. Accordingly, the efficiency of the cycle and, hence, 
of the engine as a whole is s 


Qo 


n=1-—. 


The described process for the operation of a heat engine is, natu- 
rally, an abstract scheme. However, the essential features of every 
heat engine are incorporated in this scheme. An expanding and con- 
tracting gas or steam is the working substance, the surrounding 
medium plays the role of cold body, and a steam boiler, or a fuel 
mixture in internal combustion engines, serves as the hot body. 

A refrigerating engine, in which the cycle is reversed, requires 
the same three system components. The principle of operation of 
this engine consists in the following: Expansion of the working sub- 
stance occurs when it is in contact with the cold body. Thus, the 
cold body is cooled even further, which is precisely the task of the 
refrigerating engine. Now, in order to complete the cycle, the work- 
ing substance must be compressed and the heat given up by the 
cold body rejected. This is accomplished when the working sub- 
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stance is in contact with the hot body. Thus, the hot body becomes 
even hotter. The “unnatural” transfer of heat from a cooler body to 
a hotter body is at the “expense” of work. We see, then, that when 
the cycle is conducted counterclockwise, the relationship between the 
energy transferred to a medium and the energy absorbed from 
a medium, i.e., Qi + 42 = Qo + At or Q2 — Qı = — (A; — Ad), 
where as before the subscript 1 refers to the*portion of the process 
occurring when in contact with the hotter body, has the following 
meaning: The quantity of heat removed from a system must be 
compensated for by an equal quantity of mechanical work. 

The second law of thermodynamics imposes certain conditions on 
the operation of a heat engine. If a process is assumed to be revers- 
ible, the change in entropy of the working substance for the entire 
cycle should equal zero. Stated otherwise, the change in entropy for 
the expansion process must equal (except for reversed sign) the 
change in entropy for the contraction process, i.e., 


In the case of an irreversible process, the entropy of the closed sys- 
tem, consisting of the hot body, the cold body and the working 


substance, increases and, therefore, 
dQ dQ 
Seth ao 


(It should be recalled that Q is an algebraic quantity. Thus, heat 
entering the system is considered positive.) By evaluating these 
it 1 it is rather simple in a number of 


integrals for specific processes, a 
cases to determine the maximum efficiency of one or another heat 


engine cycle. 


65. Efficiency of a Carnot Cycle 


We shall now derive the expression for the efficiency of an ideal 
heat engine operating without, losses In a reversible cycle. 

Let us first consider the theoretical four-stroke Carnot cycle rep- 
resented in Fig. 81. The Carnot cycle consists of two isothermals 
(for temperatures T4 and Ts) and two adiabatics. Assume that the 
first strokes of the cycle is an isothermal expansion from state Z to 
state 2—the working substance is in contact with the hot body 
whose temperature is T, and the process takes place very slowly. 
When state 2 is reached, contact with the hot body is broken, the 
Working substance is thermally isolated and it has the possibility 


of expanding further. Work occurs at ti : 
eneney ait the temperature of the working substance is allowed to 
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drop to T. From this point (state 3), two-stroke contraction begins. 
The working substance comes in contact with the cold body at tem- 
perature T, and isothermally contracts to state 4. Here, the working 
substance is again thermally isolated and the contraction continues, 
now adiabatically, with the work- 
ing substance being heated, at 
the expense of performed work, 
to the initial temperature T4. 
The adiabatic processes in a 
Carnot cycle are of an auxiliary 
nature, enabling us to transfer 
from one isothermal to another. 
These processes do not enter into 
the energy balance, since c, (Z — 
— T»), the work of adiabatic 
expansion, and c, (Ta — 7), the 
work of compression, cancel each 
other. 
Fig. 84 In an adiabatic process, the ent- 
ropy of a system does not change. 
During isothermal expansion, the entropy of the hot body decreases 


by u and the entropy of the cold body increases by a - The working 
1 


substance returns to its initial state with its entropy unchanged. 
If the process is reversible, then paz . For irreversible processes, 
1 


the entropy of the entire system, consisting of the cold body, the 
hot body and the working substance, increases, i.e., the entropy 


increment oe is greater than the decrement a : 
2 1 


[Q21 1Q1 | 
OT ae os 


Thus, A oe and, therefore, the efficiency of a Carnot cycle is 
1 
T. 


Nmax = TR 


The efficiency of the cycle is determined by the temperatures 
of the cold and hot bodies, respectively. The greater the drop in 
temperature the greater the efficiency of the engine. It is not dif- 
ficult to see that the efficiency of a Carnot cycle is the maximum 
efficiency possible. There is no 
and, in this sense, it serves as a 
They strive to make actual cyc 
engine. 


model for designers of heat engines. 
les approach the cycle of this ideal 


cycle better than the Carnot cycle ` 
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It is not difficult to prove that the efficiency of a Carnot cycle is the opti- 
mum. Fig. 82 shows an arbitrary cycle inscribed in a Carnot cycle. The de- 


crease in the entropy of the hot body may be repre- 
sented by the integral 


B o 
JT: 


for which the inequality 


ae ae 
Irr art 


J 
A 


Fig. 82 


is undoubtedly valid, since 7 is the largest value 
assumed by 7 in the integration. The increase ` : 
in the entropy of the sold body is . expressed by the integral’ 


aQ 
\ Se 
B 
for which the inequality 
A 
dQ Pi =Q 
\>< Tz g T2 


is valid, since 7% is the smallest value assumed by 7 in the, integration. For 


a reversible process, 


Therefore, ah 
KARTET 
Ta at 
which yields the condition: F 
2 
Nmax = de TE i 


Thus, the Carnot cycle has the maximum efficiency of all possible 


cycles. a 
This maximum efficiency formula shows why steam engines have 


low efficiency. At T2 = 300° and T, = 400°, the efficiency is 25 per 


is i i ffici 
cent. Moreover, this is the maximum e i 
ideal reversible engine operating without any losses in energy. It 
is, therefore, not sup aE that i 
oyi 10 per cent. Courses 1 i i 
used a ae the efficiency. Clearly, the most important method 
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is to increase the temperature of the hot body, i.e., the steam or 
fuel mixture. 


66. The Second Law of Thermodynamics 


As indicated above, the second law of thermodynamics states 
that the entropy in a thermally isolated system increases. This state- 
ment may appear to be somewhat abstract, but it is not the form 
in which this idea was expressed historically. In view of the tremen- 
dous importance of this law of nature, we shall briefly discuss other 
important formulations of the second law of thermodynamics and 
show that they are equivalent to the above. 

Historically, the second law of thermodynamics was expressed 
in the form of Thomson’s postulate on the impossibility of creating 
a perpetual engine of the second kind. A perpetual engine of the first 
kind creates work “out of nothing”, i.e., its work violates the first 
law of thermodynamics. A perpetual engine of the second kind 
produces work by means of a periodically operating engine merely 
by absorbing heat from the surrounding medium. If such an engine 
were possible, it would be practically eternal, for the supply of ener- 
gy in the surrounding medium is almost limitless and the cooling, 
say, of the oceans’ waters by one degree would yield an inconceiva- 
bly large amount of energy. The mass of water on the Earth is of 
the order of ~10'8 tons. If this entire mass of water were cooled by 
only 4°, the heat released would be about 1024 kcal, which is equiva- 
lent to the complete combustion of 10" tons of coal. Rolling-stock 
loaded with this quantity of coal would extend for a distance of 
~10" km, which is the order of magnitude of the dimensions of the 
solar system! 

A perpetual engine of the second kind is a heat engine working 
with a hot body, but without a cold body. If such an engine were 
possible, it could work on a single stroke. A gas contained in a cyl- 
inder with a piston could indeed expand, but the operation of the 
engine would end there, since for the engine to continue operating, 
the heat absorbed by the gas must be transferred to a cold body. 
Formally, the formula for maximum efficiency shows that a per- 


petual engine of the second kind is impossible. In the absence of 
a temperature drop (T, = T,), the maximum efficiency is equal to 
zero. 


It is impossible to design a periodically operating perpetual engine 
by combining an isothermal expansion with an adiabatic com- 
pression process. Such a process would not be possible even if we 
could make it reversible. For isothermal expansion of the working 
substance, the entropy decreases. Hence, the compression process 


would have to yield an increase in entropy. This, however, is not 
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possible for an adiabatic process, since it proceeds at constant 
entropy. 

The postulate of Clausius also completely corresponds to the 
formulation adopted here for the second law of thermodynamics. 
It states that heat cannot be transferred from a colder body to 
a hotter body without compensation. A process contradicting the 
postulate of Clausius would take place with a decrease in entropy. 
At the very beginning of our discussion, this was shown to be impos- 
sible. 

We shall again return to the second law of thermodynamics in 
Sec. 77, where it will be discussed from the standpoint of the kinetic 


molecular theory. 


CHAPTER XII 


KINETIC THEORY OF GASES 


67. General 


If the molecules of a solid body are assumed to be contiguous, we 
can accurately determine their dimensions by X-ray analysis (p. 385). 
Then by comparing these dimensions with the space available to 
a molecule in a gas, the fundamental properties of the gaseous state 
of matter may be immediately determined. 

The largest linear dimensions of a diatomic molecule of oxygen 


is about 4 Å. Nitrogen molecules have approximately the same 
dimension, but molecules of hydrogen are considerably smaller. The 
volume of an oxygen molecule is about 10-2? cm?. Since under normal 
conditions there are 2.7 X 101° molecules in 4 cm? of oxygen, the 
space available to a molecule is about 0.4 x 40-1 em3. Comparison 
of the volume of a molecule with the space available to it shows how 
little of the space is occupied by molecules. It is-evident that for 
such a low density collisions between molecules will be rel 
rare. On the average, the length of the path traversed by a m 
between consecutive collisions is 1,000 A. However, the velocity 
of a molecule is large, about 500 m/sec, so that on the average 
a collision occurs every ten-thousand millionth (10-20) of a second, 
In will be shown below how these figures were obtained. 

Molecules begin to draw together only when the distances between 
them become comparable to their own dimensions. Therefore, for 
a large part of their path, molecules move rectilinearly and uniform- 
ly. Only when one molecule comes within range of another does the 
force of interaction become effective. Since the interaction occurs 
over an insignificantly small portion of the path, we can speak of 
a collision between the molecules. The interval of time during which 
molecules perceptibly interact—in other words, the impact time—is 
equal to about 10-* sec. Thus, a molecule spends by far the greatest 
part of its “life” in free motion subject to inertia. 

This is the situation for gases under normal conditions. An in- 
crease in pressure, leading to an increase in density, may consider- 
ably alter the picture. 

The internal energy of gases in which interaction between mole- 
cules occurs only for the time of instantaneous collision does not. 
contain potential energy of interaction between molecules. Such 
gases are called ideal. The use of one and the same term a second 


atively 
olecule 
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time will be shown to be justified by demonstrating the validity of 
the equation of the gas state for such gases. 

Thus, a gaseous substance consists of a tremendous number of 
minute particles that pass through large spaces without colliding, 
then collide like billiard balls and fly apart in different directions 
with different velocities, until the next collision. If we were to trace 
the path of a single gas molecule (naturally, this can be done only 
mentally), we would find it moving now to the left or to the right, 
now forward or backward. Sometimes it would be moving with 
a large velocity and at other times it would be moving slowly. In 
view of the chaotic nature of thermal motion in a gas, the molecules 
of a free gas in thermal equilibrium may be considered to have uni- 
form density distribution throughout its volume. Furthermore, at 
a given instant, there will undoubtedly be equal quantities of mole- 
cules moving in all directions. Other random events will similarly be 
uniformly distributed. For example, at all locations, equal numbers 
of molecules per second of observation will be travelling without 
collision a distance of 100 A to 200 A. 

It must be realised, however, that these statements are of a statis- 
tical character. They are valid on the average, whereby the greater 
the number of gas molecniles involved the greater the validity. 

We assert, for example, that the number of molecules moving 
“to the right” is the same as the number moving “to the left”. Natu- 
rally, this does not mean that the numbers are equal to within sev- 
eral units. The number of molecules involved is so large that not 
only is a difference of several units insignificant, but even a differ- 
ence of several million is negligible percentagewise. 

If numerous measurements are taken of the gas density of a given 
volume, the values obtained for the number of molecules will differ 
somewhat from measurement to measurement. From these data, we 
can determine the average value for the number of molecules in the 
volume under consideration. If it were possible to measure within 
an accuracy of even several thousand molecules, the individual 
measurements would oscillate, percentagewise, to an insignificant 
extent about this average value. 

When it is stated that a number of molecules have such and such 
a velocity, or move in some direction or other, or collide in accord- 
ance with some mechanism or other, then the average value of the 
number is always meant. If the number of gas molecules is large, 
the deviations of the instantaneous values from the average, 1.e., 
the fluctuations, are negligible. In a very rarefied gas, however, the 
fluctuations may be considerable. e 

It is shown in the theory of probability that, using absolute val- 
ues, the average relative deviation of the gas density from the average 
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is approximately equal to = » Where z is the number of molecules 
n 


in a unit volume. Since there are 2.7 x 401° molecules in 1 cm? 


of gas, the fluctuation of the gas density within one cubic centimetre 
amounts to 


: 1 
V2.7 x 1019 


i.e., 2 X 10-10 from the average value. It is evident that such devia- 
tions are beyond experimental observation, 

This is how matters stand with respect to all gas properties that 
are determined by the average number of molecules. 

The origin of the kinetic theory of gases dates back to Daniel Ber- 
noulli (1700-1788). M. V. Lomonosoy (1711-1765) also made sub- 
stantial contributions to its development. In the 49th century, 
the kinetic theory of gases developed under Clausius (1822-1888), 


Maxwell (1831-1879) and Ludwig Boltzmann (1844-1906) and as- 
sumed its modern form. 


68. Mean Free Path 


The distance traversed by a molecule between two consecutive 
collisions (the range of a molecule) is, naturally, a random quantity 
that may sometimes be very small or very large for individual mole- 
cules. However, in view of the chaotic nature of the particle motion, 
the average value of this quantity for a given gas state is undoubted- 
ly constant. The mean free path or, for brevity, the range Lis relat- 
ed to the average velocity v of the molecular motion and the average 
time t between two collisions by the simple relation: 1 = vt*, Typ- 
ical values for these quantities were cited on p. 188. 

The range of a molecule depends, in the first place, on the number 
of molecules in a unit volume of gas. Moreover, it is evident that the 
larger the dimensions of the molecule the smaller the mean free path. 

In order to visualise the character of this relationship, let us con- 
sider a cylindrical volume of gas through which a molecule moves 
along the cylinder axis. What is the path taken by the molecule? 

Molecules are not points. They have dimensions determined by 
the distances for which molecular interaction becomes effective. 

On the basis of crystallochemical measurements (see p. 385), we 
may, with considerable accuracy, ascribe a certain form to molecules. - 


* Since we are merely concerned with the determination of the connection 
between the physical quantities and not wi 


formulas, we shall not differenti cae th the determination of the exact 
RA not differentiate between É 3 
velocities (see below), average and root-mean-square 
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At distances extending beyond the limits of the molecule’s “bound- 
ary”, the forces of interaction are, practically, not effective. 

Let us project the maximum cross-section of the molecules on to 
the base of the cylinder. Each molecule will be projected differently. 
Since there are many molecules, the average cross-sectional area 
will characterise a molecule with sufficient accuracy. This average 
area of cross-section o is called the effective cross-section. 

A collision will surely occur along the length of the cylinder if 
the area of the cylinder base is completely filled with the cross- 
sections of the molecules. If the cylinder base is equal to 4 cm?, 
cylinder length equal to Z, and the number of molecules per unit 
volume equal to nz, then there will be a total of nl molecules in the 
cylinder. The projections of the cross-sections of these molecules 
will completely cover the cylinder base when nlo = 1. Under these 
conditions, the value of 2 will have an order of magnitude that is 


x 1 2 
close to the average range of the molecule, i.e., Lœ zz- More rigorous 


calculations confirm the validity of this rough estimation. In the 
exact formula, the factor |/2 enters in the denominator: 


c 1 
k2ng- 


where o has a constant magnitude for a given gas. Thus, the mean 
free path is determined only by the density. A decrease in density 
by a factor of 100, for example, results in an increase in the mean 


free path by the same factor. y j 
For air under normal conditions, the effective cross-section o is 


approximately equal to 5 x 14075 cm*. This is in excellent agree- 
ment with the dimensions of oxygen and nitrogen molecules obtained 
from crystal measurements. The maximum dimension is equal to 
4.3 Å and the minimum is a little less, namely,, 3 A. The radius of 
a circle having an area of 5 X 10715 cm? is 4 A. f 

We can determine the dimensions of molecules by studying crys- 
tals. However, the investigation of particle collisions may be viewed 
as a method of establishing the effective cross-section of particles. 
This method is of value in studying atomic nuclei (p. 554). 


ath under normal conditions is: in air—600 A, in nitro- 


Th free p ; 
70S. : d in helium—1,800 A. 


gen—600 A, in hydrogen—1,100 A an 


69. Gas Pressure. Root-Mean-Square Velocity of Molecules 


Let us consider the problem of using the’ simplified concepts 
regarding the motion and interaction of gas molecules to express the 
gas pressure in terms of the quantities characterising the molecule. 
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Assume that we have a gas enclosed in a spherical tank of radius 
R and volume v. Disregarding collisions between gas molecules, we 
may adopt the following simple scheme for the motion of each mole-’ 
cule: A,molecule moves rectilinearly and uniformly with some veloc- 
= ity v, strikes the wall of the vessel 
and rebounds at an angle equal to 
the angle of incidence (Fig. 83). Tra- 
versing chords of equal length, 2R 
sin 0, time after time, the molecule 
strikes the wall of the vessel san 
times per second. For each impact, 
the momentum of the molecule 
changes by 2 mv sin 0 (see p. 68). 
The change in momentum per sec- 
ond is equal to T. 
We see that the angle: of inci- 
p dence cancels out. If the molecule 
Bg Ee strikes the wall at an acute angle, 


the impacts will occur often, but 
will be weak. If the angle of incidence is close to 90°, the molecule 


will strike the wall less often, but will make up for it by stronger 
impacts. 

The change in momentum for each impact of the molecule on the 
wall contributes to the overall force of the gas pressure. It may be 
assumed in accordance with the fundamental law of mechanics that 
the force of the pressure is simply the change occurring in the momen- 


tum of all the molecules in one second: ate -.. or, factoring 


out the constants, 
m 
Rtt...) 
Assuming n molecules are contained in the gas, we may introduce 


the concept of the average of the velocity squared of a molecule, 
which is determined by the formula 


T= ioo anhe 


The expression for the force of the pressure may now briefly be 
written as follows: A 


mnv? 
1 ERA 
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Dividing this expression by 40R?, the surface area of a sphere, we 
oblain the gas pressure: 


nmv? 
D 4nR3 ~ 


Replacing 4c by 3V, the following interesting formula is obtained: 
af 2 v2 
pV =4 nm or pV =an (=) A 


Thus, the gas pressure is proportional to the number of gas mol- 
ecules and to the average value of the kinetic energy associated with 
the translatory motion of a gas molecule. ‘ 

A very important conclusion may be drawn by comparing the 
obtained equation with the equation of the gas state. Comparison of 
the right-hand members of the equations shows that 


p mo? \ mv2 3 uR 
pata=on(7-) or S-=a 4 


i.e., the average kinetic energy of molecular translation depends 
only on the absolute temperature and, moreover, is directly pro- 
portional to it. i 
This conclusion shows that gases obeying the ome onia the gas 
state are ideal in the sense that they approximate the i eal mo e 
of a group of particles whose interaction 1s insignificant. Ate 
it shows that the concept of absolute temperature, in tronneea ae 
ically as a quantity proportional to the pressure of a rarefied gas, 
has a simple kinetic-molecular interpretation. The absolute temper- 
ature is proportional to the kinetic energy of molecular translation. 
The ratio 2 = NV is known as Avogadro’s number. It is the number 
t ; Pa x 
of molecules in one gram molecule and is a universal constant: 


seg! oe 
N = 6.02 x 102%, The reciprocal quantity >, is equal to the mass of 
a hydrogen atom: 


My == 1.66 x 10 gm. 


Another universal constant is the quantity 


Fie LEOA £ — 1.38 x 104% erg/deg, 


n 


m2 _ 3y 
z =74T 


13—1409 
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If the velocity squared, v?, is represented by the sum of the squares 
of its components, v? = vi + vh + vi, it is evident that the average 
energy for each component is 


1 > 
x ht. 


This quantity may be described as the energy associated with one 
degree of freedom. 

The universal gas constant is accurately known from experiments 
with gases. The determination of Avogadro’s number and Boltz- 
mann’s constant, which are expressed in terms of each other, is a 
relatively difficult task involving delicate measurements. 

These results put at our disposal useful formulas for calculating 
the average molecular velocity and the number of molecules in 
a unit volume. 

Thus, for the average of the velocity squared, we obtain 


=a SRM SRP 


mN  M? 
where M is the molecular weight. The square root of the average of 
the velocity squared is called the root-mean-square velocity: 


Vime=YV oe Or Vrms = pea , 
i.e., the r-m-s velocity is directly proportional to the square root of 
the temperature and inversely proportional to the square root of 
the molecular weight. It is easily determined that at room tempera- 
ture oxygen molecules have a velocity of 480 m/sec and hydrogen 
molecules—1,900 m/sec. At the temperature of liquid helium, these 
molecules would have, respectively, velocities of 40 m/sec and 
160 m/sec, while at the temperature of the surface of the Sun, namely 
6,000°, these velocities would be 2,160 m/sec and 8,640 m/sec, 
respectively. These examples are unrealistic, however, for, at the 
temperature of liquid helium, oxygen and hydrogen solidify and no 
translatory motion of the molecules will occur, while at the temper- 
ature of the surface of the Sun the molecules disassociate into atoms. 


We obtain the following simple expression for the number of 
molecules in a unit volume: 


DATE 3p P 


Ki 


Avogadro's law follows from this and may be stated as follows: 
For equal pressures and temperatures, all gases contain the same 
number of molecules per unit volume. Thus, under normal conditi- 
ons (at a pressure of 1 atm and a temperature of 0°C), 2.683 x 101° 
molecules (Loschmidt s number) are contained in 4 cm’. 
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70. Internal Energy of a Gas 


The properties of monatomic gases are determined by the kinetic 
energy of translation of the molecules. An atom’s internal energy 
does not affect the thermodynamics of the gas. Evidently, the inter- 
nal energy need be considered only when the temperature of the gas 
is very high and collisions between atoms may lead to their excita- 
tion and ionisation. These processes will be discussed in detail 
later on. 

Thus, the following formula for the internal energy of a monatom- 
ic gas will have very broad application: 


where N is the number of molecules. Using the formulas of the pre- 
vious article, we obtain for 1 mole of an ideal monatomic gas the 


expression 
U=ŻRT. 


Hence, for the thermal capacity of 1 mole of a monatomic gas, we 
obtain by means of the formulas of Sec.‘ 60: 


=FR 
and 
p= R. 


_ The direct proportionality between the temperature and the 
internal energy, and hence the constancy of the thermal capacities 
of a monatomic gas, are valid for quite a broad interval of external 
conditions. [Xe 
_ For polyatomic gases, such a simple picture is valid for a signif- 
icantly narrower interval of temperatures, if valid at all. The reason 
for this is that the energy of a polyatomic molecule consists of the 
energy of translation, the energy of rotation and the energy of vibra- 
tion of the molecule’s components (i. e., the molecule’s atoms) with 
respect to each other. Calculation of the average energy per molecule 
becomes quite difficult. It turns out that the energy of a molecule 
1s no longer linearly dependent on the temperature and, hence, 
that the thermal capacities are no longer constants independent 
of the magnitude of the temperature. Nevertheless, it is usually pos- 
Sible to find a narrow interval of temperatures in which the thermal 
Capacities do not depend of the temperature. This occurs for such 
Values of temperature at which the average energy of the molecule 
is not yet sufficient for the collisions of the molecule to lead to 
13* 
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a change in its vibratory state. At the same time, this energy is suf- 
ficiently large so that the discrete (quantum) character of the energy 
of rotation is not felt. Jumping ahead and referring the reader to 
Fig. 266 (p. 606), it may be stated that linear dependence between 
energy and temperature, and constancy of thermal capacity, will 
occur when the quantity T, descriptive of the order of magnitude 
of the translational energy of the molecule, is considerably greater 
than the distances between rotational energy levels and less than the 
distances between vibrational energy levels. 

If such an interval exists, the energy of a mole of gas and the 
thermal capacities of this quantity of gas are expressed by the fol- 
lowing simple formulas: 


GEIRI 
Cooney 
Cp=4R. 


The doubling of the internal energy and c, with respect to a mona- 
tomic gas may be explained in the following manner. A polyatomic 
molecule has six degrees of freedom, while a monatomic molecule 
has three. Since there are twice as many degrees of freedom, there is 
twice as much internal energy. To be sure, there is nothing self-evi- 
dent about this statement. However, we find support for this view- 
point when we consider a gas consisting of diatomic molecules. 
Since a diatomic molecule is a system consisting of two particles, 

it possesses five degrees 

aR or of freedom (see p. 44). If 
oF the internal energy is 
indeed proportional to the 
number of degrees of 
freedom, then for a gas 
consisting of diatomic mol- 
ecules the following for- 
mulas should be valid: 


5 5 
U= 7 Rf, => 


T°K 7 
and ¢p= > R. 


Fig. 84 


Experiments show that 
in the temperature range 
in which the thermal capacity remains unchanged these formulas 
are quite applicable. The internal energy of one mole of a diatomic 
gas at a room temperature of 300°K is 1,500 cal = 6,250 joules. 

A typical dependence curve for thermal capacity over a broad 
interval of temperatures is illustrated in Fig. 84. 
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71. Statistical Distribution 


Numerous events occur that cannot be predicted. These are called 
random events. The height of a young man appearing for military 
service, the number of pedestrians passing a particular crossing 
during certain hours, and the number of winning tickets in a loan 
lottery falling on each series of one hundred bond numbers are all 
examples of random events. The results obtained by observing numer- 
ous events of a single type, for example, measuring the height 
of a large number of young people, counting the number of pedestri- 
ans per minute over a large number of days, or analysing the number 
of winning tickets for a large 
number of loan lotteries, 
may be summarised in the 
form of a so-called distri- 
bution curve. In the case of 
the height of a person, the 
data may be processed in 
the form of numbers indi- 
cating the number of men 
called up for military serv- 
ice whose heights are be- 
tween 170 cm and 171 cm, 
between 171 cm and 172 cm, 
etc. Thus, the probability | 
of observing a person among F 
those called up who has pre- Higa ep 
cisely a given height (e. g., we 
171.34 cm) is practically equal to zero. Therefore, it is better to 
refer only to the number of men called up haying a height lying in 
a particular interval. . Tepes: 

Inthe case of the analysis of the prize list, the distribution curve 
may be constructed on the basis of the data for the number of series 
of one hundred bonds for which there was not a single winner, for 
Which there was one winner, two winners, etc. : 

If we construct a graph, plotting the random quantity (e. [oe 
height, number of pedestrians or number of winners) along the horizon- 
tal axis and the number of random events (e. g., number of people 
having a height lying in a particular interval, the quantity of cases 
of a given number of winners per one hundred numbers, etc.) along 
the vertical axis, the obtained curve is a distribution curve. An 
example of such a curye is shown in Fig. 85. The curve is drawn 
through the mid-points of the tops of the reclangles. Each rectangle 
has an‘area equal to the number of times a random event occurred 


for the quantity lying in the given interval. 


Number of random events 


Random quantity 
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The remarkable feature of distribution curves is their reproduci- 
bility. If we construct distribution curves analysing the height of 
young men called up for military service for a number of years, it 
will be seen that the curves are entirely similar. This similarity 
will not be found if we study the height distribution curves construct- 
ed on the basis of a small number of measurements. As we increase 
the number of measurements on which each curve is based, the curves 
for different years will become more and more similar. The same 
holds true for the distribution curves of all events, provided the 
events are random and the conditions of the obtained curves do not 
change. 

We call the distribution law of one or another quantity a statis- 
tical law. It is more accurately given the greater the number of events 
used to determine each ordinate of the curve. 

Naturally, knowing the distribution curve does not enable us to 
predict the number of a bond that will win in the next lottery. How- 
ever, we can say, for example, what portion of the series consisting 
of one hundred numbers each will have one winner. The greater the 
number of bonds used in the analysis, the greater the accuracy of 
this prediction. 

In view of the large number of molecules contained in very small 
volumes of matter, all kinds of statistical predictions about the 
behaviour of molecules are made with particularly high accuracy. 
A distribution curve of one or another random quantity plotted for 
the molecules of a substance will be reproduced with tremendous 
accuracy because each “rectangle” of the distribution curve corre- 
sponds to thousands of millions of molecules. 


72. Boltzmann’s Law 


Certain ideas about the distribution of molecules follow immedi- 
ately from the chaotic nature of thermal motion. This applies to 
the velocity distribution of the molecules according to direction and 
to the volume distribution of the molecules for the case when no 
forces act on the gas. However, there are numerous cases when the 
consequences of the assumption regarding the chaotic nature of 
thermal motion are not evident in advance. 

First, there is the question of the distribution of the molecules 
according to speed. What percentage is moving rapidly, and what 
percentages are moving with average or slow speeds? Then, there 
is the problem of determining how a uniform density distribution 
of molecules changes when the gas is placed in a field of force, say 
in a gravitational field—or, if the molecules have electric or magnetic 
properties, in an electric or magnetic field. Boltzmann's law, which 


72. Boltzmanns Law 199 


may be derived by means of the theory of probability, gives the 
answers to these and similar questions. 

Let us consider a small volume of space—a cube at point 2, y, 2 
whose sides are Az, Ay, Az. Assume a considerable number of mole- 
cules to be contained in this cube. We shall consider those molecules 
having velocity components in the ranges from vy to vy + Avy, Vy 
to vy + Av, and vz to vz + Av;. The magnitudes of Avx, Avy and 
Av, are such that a large number of molecules are contained in the 
indicated interval ofj velocities. This is necessary in order to be able 
to apply the laws of statistical physics to these small volumes (phys- 
ically, infinitesimal volumes). In the future, we shall say that such 
molecules have coordinates in the neighbourhood of x, y, z and veloci- 
ties in the neighbourhood of vx, Vy, Vz- We repeat, to speak of a quan- 
tity of molecules that have exactly a given velocity is impermissible, 
for the probability. of encountering such a molecule is infinitely 
small. Since the kinetic energy of a molecule is determined by 
the value of the velocity and the potential energy of a molecule in an 
external field depends on the coordinates of the molecule in space, 
all the molecules segregated by us have, practically, one and the 
same energy 6. 

Boltzmann’s law, based on considerations developed in courses 
on theoretical physics, gives a general expression for the number of 
molecules whose coordinates are in the neighbourhood of z, y, Z 
and velocities in the neighbourhood Of Vy, Vy, Vz: This number is 


An = Ae €/#T Ax AyAzAvxAv,Avz, 


where A is a constant that may be determined for a concrete problem, 
T is the absolute temperature and k is Boltzmann’s constant. 

The energy in the exponent is equal to the sum of the kinetic 
energy of translation of the molecule and the potential energy of 
the molecule in the external field: 


g= +U. Hence, 


meu 
An Her * AaAyAzAvyAvyAvz. 


This formula also applies to the case when the molecule possesses 
other forms of energy as well, for example, rotational and vibration- 
al. These components of the energy must then be included in é. 

Boltzmann’s law or, as it is also called, Boltzmann's distribution, 
shows that the largest energy corresponds to the lowest number of 
particles whose velocities and coordinates lie in the given interval. 

We shall apply Boltzmann's law to the solution of important prob- 
lems related to the height distribution of particles and the velocity 


distribution of molecules. 
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73. Distribution of Particles with Respect to Height 
in a Gravitational Field 


If, in a liquid, there are a large number of small particles that are 
heavier than the liquid and do not dissolye in it, at first glance it 
may appear that sooner or later these particles must fall to the bot- 
tom. This, however, does not occur—but it would if there were no 
thermal motion. 

Thus, the force of gravity attracts the particles downwards, but 
the chaotic thermal motion, an inherent property of all particles, 
will continuously impede the action of the gravitational force. 
A particle moving downwards may experience a collision on the way 
that hurls it back upwards. It again begins to move downwards and 
again a collision may hurl the particle upwards or sidewise. While 
some particle may succeed in reaching the bottom of the vessel, 
another particle, on the other hand, may be raised from the bottom 
by random impacts and brought to the upper layers of the liquid 
by random impulses. It is quite understandable that as a result 
some nonuniform distribution of particles is established. In the 
upper layers there will be the least number of particles, while at 
the bottom of the vessel there will be the greatest number. The 
heavier the particles and the lower the temperature, the more 
will the height distribution of Particles be “compressed toward the 
bottom”. è p 

The quantitative aspect of this interesting ph occur- 
ring for all particles located in a gravitational field (molecules of 
a gas or particles of an emulsion suspended in a gas or liquid) be- 
comes clear from Boltzmann’s law: We may rewrite the exponential 
factor in the formula for the Boltzmann distribution in the form 


mv mgh 


e 2kT¢ 


AT., 

U, the potential energy of gravitation, has been replaced by the 
expression mgh. Now, we are interested in the number of molecules 
(of all velocities) located at a height between % and k + Ah. It is 


mgh 
An=ne *¥T Ah, 


Here, the coefficient of proportion 


a ality no is simply the specific num- 
ber of particles ie ath = 0. Fi 


g. 86 shows how the number of parti- 
cles decreases with increasing height. 


The form of this relation confirms the correctness of the assertion 
made above that the greater the mass of the particles and the lower 
the temperature, the more rapidly does the curve fall. It is also 
evident from the curve that its rate of decrease depends on the gravi- 
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tational acceleration. On different planets, the distribution of par- 
ticles with respect to height will differ. 

According to the above formula, at least a small number of 
molecules exists at every height above the Earth’s surface. This 
means that molecules may recede from the Earth and fly into space, 
for it is not excluded that as a result of random collisions one or 
another molecule will attain a velocity of 14.5 km/sec, which is suf- 
ficient, as we know, to escape 
from the Earth’s gravitational 
pull. It may, therefore, be stat- 
ed that the Earth is gradually 
losing its atmosphere. How- 
ever, calculation of the rate of 
dispersion of the atmosphere 
shows that it is negligible. 
During the entire existence of 
the Earth, an insignificant 
amount of air has been lost. 
The situation is different as 7 
regards the Moon, where the- 
velocity required to overcome Fig. 86 
gravity is ~2 km/sec. Such a 
small velocity is very easily 
attained by molecules and as a result the Moon has no atmosphere. 

The formula giving the number of particles as the function of 
height may be rewritten for the density of a gas or for the pressure 
of a gas. Since the gas pressure is proportional to the number of 
particles in a unit volume, the formula may be written in the form 


mgh 
kT . 


Be 
Th 


P=Poe 
sure at zero level. This formula is called the 


It is used by meteorologists measuring atmos- 
duce the results of their 


Here, po is the pres 
barometric formula. e 
Pheric pressure at high altitudes to re 


Measurements to “sea level”. 


i applicati la 
It is necessary to note yet another important application of the formu 

for the distribution of particles with respect to height, which was used or a 
experimental determination of Avogadro's number by Perrin, sulle Troich 
scientist, In accordance with the conditions of the experiment, Ferit bad i 
somewhat modify the formula for the distribution of molecules yi respect Í o 
height. He studied an emulsion obtained by dissolving gutta gamba e may 
of resin) in water. Using a microscope, an entire mound of spherica pin es 
could be observed in the emulsion. Perrin used a centrifuge Por the guna 
gamba granules according to size. Several months of labour yie oad -30 gm 
of gutta gamba granules having a diameter of 0.74 microns. The ghey o 
gutta gamba is D = 1.195 gm/cm?, i.e., the mass of one grain was equal to 
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7 X 10714 gm. Exact determination of the dimensions of the grains was no easy 
task. Perrin made this determination using three independent methods: 

1) The length of a chain of several dozen contiguous grains was deter- 
mined under a microscope. . ; 

2) The weight of several thousand grains was measured and the dimension 
calculated from the known density of gutta gamba. 

3) Stokes’ formula (see p. 221) was used to determine the dimension from 
observations on the velocity with which a cloud of grains sinks in an emul- 
sion. It was assumed that, in accordance with Archimedes’ principle, a grain 


4 > a 
sinks under the action of the force gyo (D — d) g, where d is the density 


of the liquid and r is the radius of the grain. When the grain sinks uniformly, 
this force is balanced by the force of viscous friction calculated by Stokes’ 
formula. From this condition, it is easy to determine r. 

There was close agreement between the results of all three methods. This 
signified that the effective weight of a microscopic granule floating in a liquid 
may be written in the form mg (1-5) - Recalling that kay , we obtain 


the following barometric formula for an “atmosphere” of gutta gamba grains 
floating in water: 


n>=Noe . 
The experiment reduced to the determination of the ratio of concentrations n 
at equal levels. This was accomplished by focussing the microscope on suffi- 
ciently thin layers of the emulsion and calculating the number of particles 
in the field of vision for equal intervals of time. Perrin changed the viscosity 
of the emulsion by a factor of one hundred and observed that the ratio of con- 
centrations exactly agreed with the barometric formula. Substituting the values 
of no, n, h, m, d, D and T, it was possible to determine N. It turned out that, 
in spite of the large changes in the viscosity of the emulsion and the dimen- 
sions of the grains, N determined in this manner agreed excellently with the 
values predicted by the kinetic molecular theory. Perrin obtained 6 x 1023 << 
<N <7 X 10%, while according to modern data N = 6.02472 X 102%. This 
was highly reliable evidence that the Boltzmann distribution accor 


2 Š ding to 
energy is applicable even to particles having a gram molecule (mass of N A 
ticles) equal to 50,000 tons! 


ate (E 2) 


74. Velocity Distribution of Molecules 


The velocity distribution of molecules, first determined theoreti- 
cally by Maxwell, an outstanding English physicist, may be consid- 
ered to be a consequence of Boltzmann’s law. 


According to Boltzmann’s law, the number of molecules whose 
velocities are in the interval f 


rom vx to vy + Av,, v, to v Av 
and vz to vz + Av, is eae j ni e 
_ m2 
An=Ce 2kT Av,Av,Av,. 
It is implied that we are interested in the velocity distribution in 
a small volume of gas and that the space distr 


My e ibution of the mole- 
cules is taken into account by the constant factor C, which does not 
interest us at the moment. 
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The above formula takes account of the distribution of the mole- 
cules with respect to the magnitudes as well as the directions of the 
velocities. However, we already know the distribution with respect 
to the directions—the number of molecules moving in one or another 
direction must be the same for complete randomness in the molec- 
ular motion. We are interested in the number of molecules having 


a speed from v to v + Av, where 
v= Vv +. 

If we construct a three-dimensional diagram, along whose axes 
Vey Vy) Vz, the projections of the velocities of the molecules, are 
plotted, and consider this space to be divided into infinitely small 
cubes of volume Av,Av,Avz, the data on the velocity distribution 
of molecules may be simply represented as the numbers of molecules 
contained in a cube. Boltzmann’s formula gives us the number of 
molecules for each one of the cubes. However, examining the for- 
mula, we see that the number of molecules is the same for all cubes 
located within a spherical shell of radius v to v + Av, for only the 
absolute value of the velocity enters in the exponential factor of the 
formula. The number of molecules having velocities in the range 
from v tov + Avis proportional to the volume of the spherical shell, 
i. e., 4nv*Av. Thus, if the number of molecules contained in one. 
cube is equal to 


mv? 
Ce 2kT AvxAv,Avz, 
the number of molecules contained in the spherical shell, i. e., 
possessing velocities in the range from v to Av, is represented by the 
formula 
mv? 
An=Ce T 4nvAv. 


i r i > At v = 0 and v = co 
What then is the nature of this dependence? At 3 
the number of molecules is equal to zero. It is evident that the curve 
must have a maximum. Let us determine in the usual manner the 
maximum of the factor preceding Av. Taking the derivative of this 


expression and setting it equal to zero, we obtain 


Hence, the value of the velocity for which the distribution function 
% . 
has a maximum is 
2kT 
C= —-. 


m 
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What can be said about this velocity? Since the number of mole- 
cules having the velocity v are plotted along the ordinate of the 
distribution curve, c is a peculiar boundary. Molecules moving with 
velocities greater or less than c are encountered less often than mole- 
cules of velocity c. 

This velocity is called the most probable. The velocity distribu- 
tion ue for gas molecules (Maxwell distribution) is shown in 
Fig. 87. 


It is interesting to compare the formulas for the most probable 
velocity and the r-m-s velocity: 


ay gn ana aye 
c= m and Vrms = AS 


We see that the r-m-s velocity is greater than the most probable. 

The reason for this is evident from the form of the distribution curve. 

Since the curve extends far 

An to the right, the root-mean- 

AV square velocity is displaced 
in that direction. 

t; Let us cite several figures | 
characterising the velocity 
distribution of gas mole- 
cules. The number of mole- 

TV, y cules with velocities close to 

Oe, the most probable velocity 

c is 1.1 times larger than 

the number of molecules 

with velocities close to the 

root-mean-square value, 1.9 times larger than the number of mol- 

ecules with velocities close to 0.5c, and 5 times larger than the 
number of molecules with velocities close to 2c (see Fig. 87). 


Fig. 87 


75. Measurement of the Velocities of Gas Molecules 


Even though the law of molecular velocity distribution is based 
on exceptionally well-founded theoretical grounds, whose validity 
is confirmed by a large number of physical facts, it is interesting to 
subject the distribution formula to direct experimental verification. 

The velocity of gas molecules can be measured in a volume only 
by indirect means. If a molecule radiates light, the velocity of its 
motion affects the width of the spectral lines (Doppler effect). 

Direct means of measurement require molecular beams. For this 
purpose, a long tube of large diameter is partitioned by two shutters 
having very small apertures. The gas is placed in an end compart- 
ment, whereupon the molecules begin, at first, to penetrate into the 
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middle compartment and will sometimes even reach the compart- 
ment at the other end. Clearly, only those molecules whose vector 
velocities are directed along the axis of the tube when passing through 
the first aperture can traverse the entire length of the tube. Thus, 
a molecular beam is separated out from the gas. The velocities of the 
beam molecules all have the same direction. It is evident, however, 
that due to the random motion of the molecules, the distribution 
with respect to speed will be the same as for the molecules of any 
other direction of motion. 

To measure the velocities of the beam molecules, we can resort to 
an arrangement reminiscent of an apparatus used for measuring the 
velocity of a bullet. Such an apparatus has two cardboard disks 
rigidly fixed on a shaft and rotate about it with a velocity œ. If 
the bullet travels parallel to the axis of rotation, the disks will be 
consecutively pierced at two points displaced in azimuth by an angle 
p with respect to each other. This angle corresponds to the rotation 
of the system while the bullet traversed the distance 1 between the 


disks. The rotation time for angle ọ is equal to z. Hence, the velocity 


of the bullet is 


Since molecules cannot pierce disks, the analogous experiment 
for molecular beams is performed with disks in which slits are cut 
along radii. The angular distance between the slits is equal to ọ. 
Clearly, molecules of velocity v can pass through two slitted rotating 
disks only for a specific angular velocity ©, satisfying the condition 
wee Thus, by varying @, we can filter the molecules according 
aving the same velocities and 


wW Dis 
to their velocities, collect molecules h 


measure their relative quantities. l 
The velocity distribution formulas discussed above, and hence the 


formulas for the r-m-s and most probable ' values of molecular 
velocities, have been verified by numerous experiments. 


76. Probability of a State 


ivided into two equal parts by a partition 
been cut. If there are molecules of gas in 
ferred from one half of the box to the other 
th the walls of the container and 


Let us consider a box d 
in which an aperture has 
the box, they may be trans } 
as the result of random collisions w1 


with each other. be 
t that the molecular motion is completely hap- 


Tn spite of the fac ole 3 
hazard, a method exists for predicting how many molecules will be 
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in the left half of the box and how many in the right half. This meth- 
od is based on the application of the theory of probability. 

If there were one molecule in the box, the chances or, as we say, 
the probability, that the molecule is in the right-hand portion is 
the same as that it is in the left-hand portion. Since, in all, there are 
two possible cases (the molecule is either in the left-hand or in the 
right-hand portion), and we are interested in the realisation of one 
of these cases, the probability of the molecule being in one half of 


the box is said to be equal to A . Now, assume that there are two mol- 


ecules in the box, designated by the figures 4 and 2. In all, there are 
now four possible dispositions—both molecules on the left, both on 
the right, molecule No. 4 on the left and No. 2 on the right and, 
finally, No. 2 on the left and No. 4 on the right. We are interested 
in the probability of finding two molecules on the left. This is one 


case out of four possible ones, so that the probability is equal to a A 
i. e., (z) For three molecules, the situation is as follows: 


Nea AO E e A 2 1 
enO EDE Ses a O48 2,3 


1 
38 r 
a yS Fy : Fes E 
i. es (>) - It is not difficult to see that, for the case of M molecules, 


the probability of all the molecules being in one part of the box is 
equal to (z)"- Whenever another molecule is‘added, it is always 
possible to place it either in the left-hand or in the right-hand part. 
Therefore, with each newly added molecule, the probability of the 
molecules being in one half of the container is obtained by dividing 
the preceding probability by two. 

When the number of molecules is still no more than one hundred, 


È 1AN. , 
the quantity (=) is already so small that we need no longer take 


account of the possibility that all the molecules will be located in 
one half of the container. However, the number of molecules in 
a cubic centimetre of gas is not one hundred, but about 102°. If the 
container is considered to be divided into two parts, the probability 
that the molecules will all turn up in one half of the vessel is equal 
4 \ 10 $ i 
to (z) . By taking the logarithm, this number may be converted 
into the form 10-31". To put this number in decimal form, 
3 X 10" zeros must be written! A person writing at a rapid speed of 
threé zeros per second will require 101° sec to write this number. 


Clearly, the probability of all three molecules being on the left is 
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This is equivalent to 300,000 million years, which is ten times the 
amount of time our solar system has been in existence. 

Let us return to the table for the disposition of three molecules. 
Only for one disposition out of eight do all the molecules turn up on 
the left. Every other disposition is also encountered one time out of 
eight. It should be remembered, however, that the molecules are 
arbitrarily numbered and there is no way of differentiating a dispo- 
sition in which Nos. 4 and 2 are on the left from one in which 2 and 
3, or 4 and 3, are on the left. Thus, compared to the one disposition 
in which there are three molecules on the left, there are three dispo- 
sitions in which there are two molecules and the same number of 
dispositions in which there is one molecule on the left. Therefore, 
the probability of some characteristic distribution existing, regard- 
less which particular molecules produce it, may be measured by the 
number of dispositions that can produce the distribution. The great- 
er this number, the more frequently will such a distribution be 
encountered, i. e., the more probable will this distribution be. 

The smallest piece of matter consists of a tremendous number of 
molecules. In their haphazard motion, the molecules are endlessly 
changing their location and velocity. However, if the substance is- 
in thermal equilibrium with the surroundings, the measurable 
Macroscopic characteristics of the substance (i. e., the pressure, 
temperature or energy, etc.) remain unchanged. One and the same 
state is achieved at every instant in another manner, i. e., the mole- 
cules are located differently (with respect to the adopted numera- 
tion of the molecules) and the velocities are different each time 
(again with respect to the numeration). One and the same state is- 
achieved by means of a tremendous number of dispositions of the 
numbered molecules, but the space and velocity distribution of the 
number of particles remains unchanged. ‘ 

The number of dispositions of the molecules leading to one and 
the same space and velocity distribution and, hence, to one and the 
same state of a substance relative to the total number of disposi- 
tions of the molecules is called the probability of the state. ¢ 

If there are only several molecules in a volume under considera- 
tion, the state of this system will continuously change. Not only 


will there be probable states, but sometimes less probable stalin a 
also be encountered. Thus, in the case of three molecules, all the 
Molecules will be in one part of the vessel, on the average, in one 
Case out of eight. However, as the number of molecules anne 
the “weight” of the most probable state also increases. The pro a = 
ity of achieving the more probable states becomes es ly 
Breater than the probabilities of other states. To make this clear, let. 

n a box, but assume that 


us return to the example of the molecules i i 
the box is divided Tk a thousand million cells. What is the proba- 
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bility that NV gas molecules are distributed in all the cells except 
one. Using the same reasoning as previously, it is not difficult to 
conclude that this probability is equal to (0.999999999)*. At first 
glance, it may appear that this will yield a value close to unity. 
However, taking N = 10%, the approximate number of gas mole- 
cules in one cubic centimetre, we find that the probability is equal 
to 410-2%1010, 

This calculation shows that the probabilities of the most probable 
states of a substance and the probabilities of improbable states are 
entirely incommensurable quantities. That is why a state of thermal 
equilibrium is macroscopically stable and the parameters of a sub- 
stance in a state of equilibrium do not change with time. 


77. Irreversible Processes from the Molecular Viewpoint 


The example of the previous article shows quite clearly that the 
state in which the molecules are distributed “uniformly” is the most 
probable. Any deviation from “uniformity’—displacement of one 
portion of the molecules to the left side of the container, disposition 
of the faster molecules on the left, directed motion of a large portion 
of the molecules, in short, any deviation from haphazardness in the 
space and velocity distribution of the molecules—results in a decrease 
in the probability of the state. This conclusion enables us to 
comprehend the kinetic-molecular nature of the irreversibility of 
actual processes. 

As was established above, the second law of thermodynamics 
for irreversible processes, i. e., the law of increasing entropy in 
thermally isolated systems, is a generalisation of experimental expe- 
rience on the impossibility of certain processes. Thus, heat cannot 
be transferred from a cold body to a hot one without compensation, 
a body cannot acquire kinetic energy merely at the expense of a 
decrease in the internal energy of the surrounding medium, and 
a gas may expand of itself but not contract. 

The existence of irreversible processes is a peculiarity of molecu- 
lar phenomena. For a purely mechanical phenomenon, i. e., a proc- 
ess without friction, the process may always be reversed. When 
a pendulum moves to the right, it passes in reverse order through all 
the states passed in moving to the left. A billiard ball rebounding 
from the side of the table in some direction or other will, in turn, 
rebound from an elastic wall placed in its path and retrace in reverse 
order the entire path traversed in the “forward” direction. The com- 
plete equivalence of “forward” and “backward” is evident for purely 
mechanical processes. Why then do molecular processes, which we 
have considered as the totality of the movements of the molecules, 
not possess the property of reversibility. There is only one reason. 
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In all irreversible processes, the probability of the state increases. 
A reversible process is a conceivable process, i. e., its implementation 
is in principle possible, but for the time available to a person for 
observation such a process is, practically speaking, improbable. 

This is not difficult to show for any irreversible process. Heat 
transfers ftom a hot body to a cold one, but not vice versa. In the 
case of gaseous bodies, such a process may be visualised as a mixing 
of fast molecules with slow ones. The reverse process cannot occur 
according to the random law, for it would constitute a sorting out of 
fast and slow molecules, i. e., a transition to a more orderly state. 

For the same reason, using a shovel, we can quite rapidly mix the 
contents of two sacks of different grain. However, we can continue 
to mix the contents of these two sacks endlessly without the grain 
separating in such a manner that one of the grain varieties appears 
above and the other below. It should be realised, moreover, that the 
number of kernels in the sacks is immeasurably less than the number 
of molecules in a cubic millimetre. 

It is also easily seen that the reverse process of spontaneous 
expansion of a gas is completely improbable. If in the partitioned box 
considered above there is gas on the left and vacuum on the right, 
both parts of the box will be uniformly filled with gas within a short 
time. In principle, it could occur that all the molecules turn up 
together again on the left. However, the probability of such an event 


= N 
is extremely small. Its value has been shown to be equal to (+) 


No matter which irreversible process we consider, the result will 
always be the same—each irreversible process is accompanied by an 
increase in the probability of the state. 

Thus, there are two quantities that increase during irreversible 
processes—the entropy S, with which we are already familiar, and 
the probability of the state W, which we have just discussed. It 
is natural that these two physical quantities should be related. Boltz- 
mann showed that such a relation does, in fact, exist. The formula 
given by him has the form S = k ln W, i. e., the entropy is propor- 
tional to the logarithm of the probability of the state. 

The second law of thermodynamics thus acquires still another 
formulation: In reversible processes, the probability of the state 
does not change, while in irreversible processes (we are referring to 
closed systems) the probability of the state increases. 


78. Fluctuations. Limits to the Application of the Second Law 


All physical properties remain unchanged if the space and velocity 
distributions of the molecules do not change. In principle, the distri- 
bution of the molecules of a substance may change with time. How- 
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ever, as we have indicated above, the most probable distributions 
stand out so sharply that deviations from these distributions must 
be considered to be very rare events. The physical characteristics 
corresponding to this most probable distribution are called the 
average characteristics. Practically, the deviation of a measured 
physical characteristic from its average value for systems having 
large numbers of molecules is impossible to observe. This is the sit- 
uation when the physical properties are being considered for vol- 
umes containing large numbers of molecules. However, if the number 
of particles in a system is small, it is also possible to observe rarer 
space and velocity distributions of molecules. The values of the 
physical characteristics corresponding to these rarer distributions 
differ from the average values. These deviations of the physical 
characteristics from their average values, which occur in systems 
having a relatively small number of particles, are called fluctuations. 
All properties of volumes containing small numbers of molecules— 
e. g., temperature and pressure, thermal capacity and thermal con- 
ductivity—are subject to fluctuations about the average val- 
ues. 

This question may be approached somewhat differently. If a tiny 
mirror suspended from a thin string is placed in a gaseous medium, 
then, from the macroscopic standpoint, the pressure of the gas acting 
on the mirror cannot manifest itself, for the forces action all sides 
equally. In principle, from the molecular standpoint, the changes 
in momentum due to the impacts of the molecules on the mirror do 
not necessarily have to balance for the different portions of its sur- 
face. A light mirror may thus begin to execute fluctuational vibra- 
tions. As was stated above, for any particle (molecule, Brownian 
particle, pea), the energy of thermal, random notion is equal to 


sur for one degree of freedom of motion. And this is the energy, on 


the average, falling on the mirror. On the other hand, the work of 
rotating the string by an angle Aq is equal to MAg. Therefore, angu- 
lar deflections of the order of magnitude Ap mo af will occur quite 
often. 

Such fluctuations are indeed observed and their measurement may 
be used for the experimental determination of Boltzmann's constant 
and, hence, of Avogadro’s number. 

Fluctuational effects limit the accuracy of measurements. A point- 
er, mirror, or any other indicating device is subject to fluctuations. 
At room temperature,the accuracy limit in energy units is about 
40 joule. In the construction of many instruments such accuracy 
has not yet been achieved, but in some of the best measuring devices 
jt already has. 
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Fluctuations limit the applicability of the second law of thermo- 
dynamics. They represent processes in which the system passes from 
a more probable state to a less probable one, i. e., processes in which 
the entropy decreases. 

This is excellently illustrated by Brownian movement, where the 
pressure fluctuations occur in small volumes and affect individual 
particles. Due to these random pressure vibrations, a particle may, 
for example, be impelled upwards. However, motion against the’ 
force of gravity requires work. In this case, the work is performed at 
the expense of the random, thermal motion of the molecules, i. e., 
only at the expense of the internal energy of the substance, which 
is at complete variance with the second law of thermodynamics. 

Although phenomena for which the entropy decreases occur at 
times in individual small volumes, i.e., the second law of thermody- 
namics is contradicted, the system as a whole will always obey this. 
law. Due to the randomness of the events, the number of processes 
occurring at the expense of the internal energy will be exactly equal 
to the number of processes occurring in the reverse direction. It can 
be rigorously proved that any attempt to create a perpetual engine 
of the second kind by “selecting” in individual small volumes proc- 
esses that contradict the second law will end in failure. 

The second law of thermodynamics has a limit at the other end of 
the scale as well. In addition to being inapplicable to systems having 
a very small number of particles, it loses validity for systems having 
an infinitely large number of particles, namely, the universe or any 


of its infinitely large components. As was explained above, the es- 


sence of the second law of thermodynamics consists in the fact that the 
greater than the 


number of equilibrium states is overwhelmingly a 
number of nonequilibrium distributions. However, for the universe, 
which consists of an infinitely large number of particles, this state-_ 
ment loses its meaning, for the number of equilibrium states and 
the number of nonequilibrium states are both infinitely large. Con- 
sequently, for the universe as a whole, one cannot speak of the differ- 
ences in the probability of states. i \ x 

This is confirmed by general philosophical and physical considera- 
tions first advanced by Engels and Boltzmann, respectively. As 
a matter of fact, if we assume that the second law of thermodynamics 
is applicable to the entire universe, then it naturally follows that 

e universe is approaching a “thermal death”. It implies that sooner 
or later “the most probable” state of thermal equilibrium will 
be established in the universe, whereupon further processes will 
Cease. This notion, first advanced by Clausius, 1s at variance with 
the materialist view that the universe 15 eternal in time. 
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CHAPTER XIII 
PROCESSES OF TRANSITION TO EQUILIBRIUM 


79. Diffusion 


A body interacting with its medium changes its state so as to come 
into equilibrium with the bodies surrounding it: its internal energy 
tends to a minimum, while its entropy increases and becomes 
a maximum when equilibrium is established. These two tendencies 
are usually conflicting. As a result it is difficult to predict the 
effect when both energy and entropy are capable of changing. Let 
us now consider the phenomena of diffusion, thermal conductivity 
and internal friction occurring in closed systems. In other words, 
we are concerned with equalisation of the concentrations, tempera- 
tures and velocities of some parts of a body with respect to others. 
(Naturally, equalisation of velocities is meaningful only for liquids 
and gases.) Since the energy cannot change in such systems, the tran- 
sition to a state of equilibrium consists only in-an increase in entropy. 

The basic laws for the phenomena of diffusion, thermal conductiy- 
ity and internal friction are very similar. Let us first consider 
diffusion processes. It is immaterial whether we deal with the equal- 
isation of the concentration of a gas or a liquid. Our discussion will 
even be valid for solid solutions (see p. 618), since in this case too 
the tendency to maximum entropy makes the atoms or molecules of 
a substance intermix so that a single concentration is established 
in all parts of the body. 

Let us consider two physically close, infinitely small volumes of 
a substance whose concentrations of diffusing atoms (or molecules) 
are c and c + de. If these two volumes are a distance dx apart, the 


ne Cee, 
ratio -zz gives the rate of change of concentration. This ratio is called 
the concentration gradient. If the z-axis is chosen so that its positive 


direction coincides with the diffusion direction, then L will be 
a negative quantity. Substance will migrate in the direction of lower 
concentrations. e 

This does not mean that all the molecules move in one direction 
in a continuous, uninterrupted stream. On the contrary, diffusion 
maintains to a considerable extent the haphazard features peculiar 
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to molecular motion. The molecules move haphazardly in all direc- 
tions, including the direction of greater concentration, but the 
probability of molecules being displaced in the “correct” direction 
is greater than for other directions. This means that through an area 
perpendicular to the flow more particles pass in the direction from 
greater concentration to less concentration than the other way round. 
The basic law of diffusion states that the flow of matter p, i.e., 
the mass of matter passing through a unit area per unit time, is 
directly proportional to the negative 
gradient of the concentration: 4 


ws pes tana= 22 <o 


D, the constant of proportionality, 
is called the coefficient of diffusion. 
The above relation is convenient 
since the coefficient of diffusion is, 
within broad limits, a constant for 
a given substance and medium. 

In measuring the concentration 
and the flow of matter, the units 
should correspond. Thus, if the con- 
centration is measured in grams per Fie. 88 
cm®, the flow should be measured in E 
grams per cm? per sec. We see then 
that the dimensions of the coefficient of diffusion are completely 
determined and in the CGS system are expressed in cm?/sec. 

A decrease in concentration usually follows a sagging curve as 
shown in Fig. 88. If we are interested in the portion for which the 
decrease in concentration may be represented by a straight line, then 


where c, and cz are the values of the concentrations at points 7% 


and a», respectively. 


The coefficients of diffusion vary within broad limits. 


For example: 3 3 
1) for gases at temperatures from 0° to 15°C: 


.778 cm2/sec; 
178 cm2/sec; 


hydrogen > oxygen, 7 
.099 cm2/sec} 


air > Oxygen, k 
air > carbon disulphide, 
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2) for solutions of blue vitriol diffusing in distilled water (where c is in 
gram equivalents per litre): 


€ D (cm2/day) 
0.4 0.39 
0.5 0.29 
0.95 0.23 


80. Thermal Conductivity and Viscosity 


The temperature equalisation process is very similar to that of 
diffusion. If the temperature of a body differs at different points, the 
-entropy is not a maximum. In order for equilibrium to be established, 
the average velocities of the molecules, and hence the temperatures, 
must become equalised. 

If the temperatures at two neighbouring points separated by a dis- 
tance dz are T and T+ dT, the ratio A expresses the rate of temper- 


ature drop and is called the temperature gradient. 

During the process of temperature equalisation, portions of the 
body having more energy give up energy to portions of the body 
having less energy. In a certain sense, heat “flows” from one portion 
to another. The amount of heat passing from one portion of the 
body to another through a unit area per unit time is called the 
heat flow q. Just as in the case of diffusion, one can assume that the 
heat flow is proportional to the negative temperature gradient. 


The greater the temperature difference, the more rapid the heat flow. 
The formula 


E aT 
R dx 
is also convenient here too since the const 


which is called the coefficient of thermal c 
for a given subst 


heat flow. For a 
simplified form 


ant of proportionality x, 
onductivity, is a constant 
ance and does not depend on the magnitude of the 


linear temperature drop, the formula assumes the 


a T2—T; 
S as ay 


It is not difficult to 


of thermal conductivity. In the CGS system 


F , this coefficient is meas- 
ured in cal/cm sec deg. It is evident 


from the formula that x is the 
1 of 1 cm? when the temperature 
drops 1° over a distance of 4 cm. 


The values of the coefficient of thermal conductivity, just as in the case 
of the coefficient of diffusion, vary within broad limits. 
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For example: 
1) for solid bodies (0°-18°C): cork —0.00012, wood— 0.0008, fused 
quartz—0.0033 and silver—1.06 cal/em sec deg 
bs 2) for liquids: carbon bisulphide (14°C)—2 X 10-4, sulphuric acid 30% 
(32°C)—62.4 X 1074, mercury (0°C)—0.2 cal/cm sec deg 
3) for gases (0°C): carbon dioxide—3.4 X 40-8, air—5.7 x 10-75, hydro- 
gen—40.6 x 107% cal/em sec deg. 
‘The third phenomenon of this class relates to the equalisation of 
velocities. If a gas or liquid moves in some direction or other so 
that different layers of the substance move at different velocities, 
the motion is unstable. Sooner or later, the velocities must become 
equalised—the slower layers are accelerated and the faster ones are 
retarded. This phenomenon is also called internal friction or viscosity. 
Let us consider a liquid or gas moving along an z-axis. Assume 
that different layers of the fluid are moving at different velocities. 
Along the y-axis, perpendicular to the direction of flow of the liquid 
or gas, take two close points separated by the distance dy. The veloc- 


ities of flow differ at these two points by dv. Thus, the ratio z is 


the gradient of the velocity and expresses the rate of change of the 
velocity as we move away from the surface of the fluid. To make 
this clear, consider a rapid stream. The velocity of flow is a maximum 
at the surface and gradually decreases as the bottom of the stream is 
approached. 
If at some instant we eliminate the causes of the fluid motion, the 
velocities of the different layers will begin to be equalised in accord- 
ance with the law of increasing entropy. In order for such an equali- 
sation to be possible, an internal frictional force must exist between 
the layers of the liquid or gas. The magnitude of this force per unit 
layer area is proportional to the gradient of the velocity, i. €., 


dv 
f= = Magi 


Here, y is the coefficient of viscosity (or internal friction). Its dimen- 
sions in the CGS system are gm/cm sec. Such a unit is called a poise. 


The viscosity of different podies varies within even broader limits than 
t alog fficients considered above. For example: 
ne e Cais: a E X 401% mines (420°C)—4 106, lead 
(9°C)—4.7 X 1014, ice (—14°C)—8.5 X 4012 poises. 
2) Liquids: ethyl ether (25°C) —0.0022, water (20°C)—0.01, glycerine 
(0.8% water 18°C) —13.93 poises. k me $ i 
i R a ra a R E 10. poise: 
EA ES Oe en has one-half the viscosity of air and 


It is interesting to note that hydrog' J 0 
seven times its thermal conductivity. That is why hydrogen is used to cool 


powerful turbogenerators. 
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81. Rate of Equalisation 


We all know that the time it takes for equilibrium to be estab- 
lished varies within broad limits. The temperature of a red-hot piece 
of iron thrown into water and the temperature of the water become 
equalised rapidly. On the other hand, the temperatures of air and 
a hot brick become equalised slowly. Nitrogen diffuses almost instan- 
taneously in oxygen, while the equalisation of the concentrations 
of a solution of blue vitriol takes many days. Similarly, the equali- 
sation of velocities also varies within broad limits, depending 
on whether we are dealing with a gas or a viscous liquid. 

A universal formula for the time of equalisation cannot be given, 
for the geometry of the object affects the time. A cooling body may 
have the form of a cylinder or a plate; a diffusing gas may initially 
be within a small spherical volume or distributed over some surface; 
and the internal friction may occur in pipes of various cross-section 
or in open reservoirs. The particular circumstances must be taken 
into account in each case and exact calculation of the value for the 
time of equalisation is a difficult mathematical problem. However, 
we can abstract from the geometric details and try to solve the 
problem in general form if we abandon the aim of obtaining an exact 
formula and are content to merely determine the proportionality 
between the physical quantities. In this connection, it is helpful to 
consider the dimensions of the physical quantities that should be 
related to each other. 

Let us consider, for example, the phenomenon of diffusion. It is 
evident that ż, the time of concentration equalisation, depends, in 
the first place, on the dimensions of the region in which the diffusion 
occurs (characteristic length Z) and the properties of the diffusing 


substance (characterised by the coefficient of diffusion D). The 
de 


diffusion equation has the form u = =) . The dimensional equa- 
tion is then 

M M 

pr =D] gr. 


j BA, z e 2 
We see that T = jpj’ Í e+» the time of equalisation t = k= and 
does not depend on the concentration. 
Therefore, the following conclusion may be drawn: Every rigorous 
solution determining the time of concentration equalisation for 
diffusion processes will always yield the equation 


L2 


De a oe 
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where K is a constant, dimensionless quantity depending on the 
geometric conditions of the problem. The quantity L, on whose 
square the speed of concentration equalisation depends, refers to the 
geometric dimension of the region in which the equalisation occurs. 
Thus, if the concentration becomes equalised within the limits of 
one centimetre in, say, 10 sec, then, within the limits of two cen- 
limetres, it will become equalised in 40 sec. 

We can solve the problem of temperature equalisation in exactly 
the same manner. The basic relation of this phenomenon includes 
the following quantities: heat, coefficient of thermal conductivity, 
temperature and distance. However, the increment of heat per unit 
volume may be written in the form 


dq=pcpdT, 


where cp is the specific heat at constant pressure and p is the density 
(thus, cpp is the thermal capacity of a unit volume). Therefore, the 
following quantities must þe related to each other: temperature, 
length, time, density, thermal capacity and thermal conductivity. 
It is easily verified that the time @ cannot depend on the tempera- 
ture and can only be expressed by the other quantities as follows: 
L?pep 
% 


a 
Thus, the time of temperature equalisation is expressed by the for- 
mula R 
t=K=, 
x 


where y designates the combination of constants -o7 + The quantity 


y is called the thermometric conductivity and has been introduced in 
order to put the formulas for the equalisation of concentration and 
temperature in similar form. The coefficients of diffusion and ther- 
mometric conductivity have the same dimensions and are complete- 
ly analogous in the two equalisation phenomena considered. 
We thus see how the cooling of a body is determined. The greater 
the density and thermal capacity, and the smaller the coefficient of 
thermal conductivity, the slower the process. 
e two rods of identical dimensions made of fused 


quartz and silver, respectiv 0.0033 cal/em sec deg, p = 


= 2.6) 3 = 0.1844 cal/gm deg, i.e., % = 0.676 X 1072 cm?/sec. 
Pt a aa 4 10.3 gm/cm® and cp = 0.0558 cal/gm 


For silver, x = 1.06 eal/em sec deg, p = 10.58 . 
deg, i.e., y = 1.74 cm2/sec. Thus, the equalisation of temperature in the quartz 


rod takes 253 times longer than in the silver rod. 
_ Just as for diffusion, equalisation of the temperatures is character- 
ised by a dependence on the square of distance, i.e., the time of 


Example. Let us compar 
zely. For quartz, % = 
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equalisation is proportional to the square of the linear dimension 

of the region. ; 
Without going through an analogous procedure, let us write the 

formula for the time of velocity equalisation for the various parts of 


a liquid or gas. It is not surprising that the form of this relation is 
similar, namely: 


2 
ee ye R 
Vv 


The coefficient v, which determines the rate of equalisation of the 
velocities, is equal to A and is called the kinematic viscosity. 


Ezample. For water, n= 0.01 poise and p= 1 gm/cm?, i.e., v= 
= 0.01 cm?/sec; for glycerine, ņ = 13.9 poises and p = 1.25 gm/cm’, i.e., 
y = 11.1 cm?/sec. This means that if a disturbance in glycerine is equalised 
in 0.1 sec, the same disturbance in water will be equalised in about 2 minutes. 


82. Steady Processes 


If a body is left undisturbed, the differences in temperature con- 
centration and velocity of the various parts of the body will be equal- 
ised, to be sure, in accordance with the principle of increasing entro- 
py. However, it is also possible to have a state of a body for which, 
over a prolonged period, the flow of heat or matter, or the velocity 
distribution of various parts of the body with respect to each other, 
remains unchanged. Processes of this type are called steady processes. 
Naturally, in the case of a steady process, the body is not in a state 
of equilibrium, 

Under what conditions are such processes possible? Let us consider 
a metallic rod to which at each instant of time, a certain quantity 
of heat is supplied at one end of the rod, while the other end is in 
thermal contact with a cold body. The condition under which the 
temperatures along the rod will not change, i.e., the condition of 
constant temperature gradient along the entire path of heat flow, is 
that the quantity of heat absorbed by the cold body be exactly equal 
to the quantity of heat supplied by the hot body during the same 
period of time. J 

Under analogous conditions, a steady diffusion process is also 
possible. To create such a process, a certain quantity of matter must 
be supplied to one part of a body and the same quantity must be 
removed from another part. In this manner, a constant difference of 
concentration is maintained between two parts of the body. 

A steady viscosity process may be obtained, for example, in the 
region between two coaxial cylinders rotating at different velocities. 
Since close to the solid surface the liquid or gas has the same veloc- 
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ity as the solid wall, a constant velocity gradient is created within 
the fluid. 

Steady processes do not arise immediately. A certain amount of 
time must elapse for such processes to become established. 

Let us assume that one end of a rod that transmits heat is placed 
in snow. At the initial instant, the temperature of the rod is equal 
to zero at all points. If the other end of the rod is then brought into 
thermal contact with boiling water, the temperature begins to rise 
throughout the rod, but, of course, not at the same rate at all points. 
Almost immediately, a high temperature is established at the end 
of the rod that is in contact with the boiling water. The temperature 
of the end of the rod that has been placed in snow will rise slowest. 
After a certain period of time, the temperature will cease rising at 
all points of the rod and a definite temperature distribution becomes 
established, i.e., the process becomes steady. The nature of the tem- 
perature distribution depends on the amount of heat supplied 
(or removed) per unit time. 

In an electric iron heated by a spiral element, the highest temper- 
ature is in the central region and the temperature gradua ly drops 
towards the outer edges. Naturally, the air immediately surrounding 
the iron is hottest. With increasing distance from the iron, the tem- 
perature falls more rapidly owing to the low thermal conductivity 
of air. 

In the case of small bodies in air or liquid, the temperature curve 
need not be considered in rough calculations, i.e., it suffices to deal 
with the temperature difference T — To between the body and the 
medium. The heat flow per unit time from the body to the medium 
may then be assumed to be proportional to this temperature differ- 
ence: 

q=k(T—T): 


The coefficient & is called the coefficient of thermal output and is an 
important engineering quantity. In courses on heat engineering, 
values for this coefficient are determined and related calculations 


discussed. A l 
Let us designate by P the power supplied to a body, e.g., electric 


power in the case of an electric iron. The condition for a steady proc- 


ess requires that 
TS D eI): 


Here, T is the body temperature established in 
TEET E It may vary considerably, depe 
e 


tions of heat exchange. ~ 
this point to comment ọn the temperature 
placed “in the sun”. The thermometer 


this steady process: 
nding on the power 


supplied and the condi 


_ It is appropriate at 
indicated by a thermometer 
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is involved in the steady process of transferring solar heat to the 
surrounding air. Depending on the value of the coefficient of thermal 
output, the thermometer lying in the direct rays of the sun may 
indicate any value whatsoever. The temperature measured under 
such conditions is the temperature of the thermometer and is in no 
way an indicator of the weather. 

We shall not consider the analogous diffusion problems. 


83. Motion in a Viscous Medium 


Consideration of the dimensions of physical quantities helps to 
solve problems of tremendous practical importance. One such prob- 
lem is the steady flow of a liquid or gas around an obstacle or, what 
amounts to the same thing, the steady motion of a body in a medium, 

The most important problem concerns the resistance force ex- 
perienced by a body in moying through a medium. This resistance 
force may depend on the body dimension L, the body velocity u, 
and properties of the liquid (or gas), namely, its density p and vis-- 
Cosity yn. Other quantities should play no part in this process. 

Let us first consider the dimensionless quantity comprised of L, u, 


p and n. It will be recalled that the kinematic viscosity, v =) , has 


the dimension Z?7'-1. But the product Zu also has this dimension. 
Therefore, 


Rec pLu Lu 


n v 

is a dimensionless quantity. This quantity is designated as indicated 
and is called the Reynolds number. It can be shown that Re is, in 
effect, the only dimensionless combination of the indicated quanti- 
ties. Other dimensidnless quantities can only be functions of the 
Reynolds number, i.e., f (Re). If the motions of different bodies in 
different fluids lead to one and the same value for Re, the motions 
are said to be similar. A large technical field founded on the prin- 
ciple of similitude has developed. In this field, the characteristics 
of a phenomenon are determined on the basis of observations of a 
similar phenomenon occurring in a model. 

Let us now return to the problem raised above, namely, finding 
the expression for the resistance force experienced by a body moving 
in a medium. 

The dimensions of force are given by MLT~-*. This can be equated 
to the dimensions of the quantities with which we are dealing, since 
it cannot depend on any other quantities. Thus, 


MLT = [p]* (ul? [L]" [y]°, 
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i.e., 
METES MEI e RINM TT. 
Hence, 


a+ô=1, 8a+B+y—6=1, —ß—ô= —2. 
Expressing «, B and y in terms of ô, we obtain 
a=1—6, pB=2—6, y=2—ô.' 


Thus, in the most general case, F may be expressed in the form of a 
sum of terms, each of which has the indicated dimensions, i.e., 


F= A [ptu Ln? | = pu2L2A [ (=) =] 3 


where A represents numerical coefficients. We have thereby demon- 
strated that the force must be given by the formula 

’ F = Kopu*L*f (Re). 
This result has been obtained only by considering the dimensions! 
The function f (Re) is not known and must be determined experi- 


mentally. 

Definitive formulas for limiting cases may be obtained by simple 
reasoning. If the velocity is small, F must be proportional to the 
first power of the velocity u. For this to be true, f (Re) must be equal 


to = and, therefore, 
Re 


F= KruL. 
The numerical value of the constant depends on the form of the 


body. For a sphere, 
F = bnnur, 


where r is the radius of the sphere. The last formula is known as 


Stokes’ formula. 

E. DA cury globule (r = 0.53 mm), sinking in glycerine with 
a veloolty of IEA, experiences a force of friction of about 8 dynes. 

In the case of very large velocities, the fluid motion with respect 
to a body ceases to be steady. Eddies appear and the motion becomes 
turbulent. The body motion may be steady, but the fluid particles 
move more or less randomly. Owing to the intense nature of the 
disturbance, the transfer of motion from layer to layer ceases to 
depend on the viscosity. This can only occur if f (Re) approaches a 
limit as the velocity increases. Therefore, for large velocities, the 


resistance force becomes proportional to the velocity squared: 
F=KowL*, 
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84. Coefficients of Diffusion, Viscosity and 
Thermal Conductivity for Gases 


The processes of equilibrium establishment in gases are closely 
related to the characteristics discussed in the previous article. Tem- 
perature, concentration and velocity equalisation of some parts of 
a gas with respect to others occur owing to the mixing of the mole- 
cules. The rate of this mixing is determined by the role played by col- 
lisions between particles. For example, in case of a large free path, 
the fast molecules quickly penetrate into the regions where the slow 
molecules are located and distribute themselves throughout the gas. 

It is quite natural that the time of equalisation in all three processes 
should be of the same order of magnitude as the time between 
molecular collisions. This may be verified by theoretical calculations 
for particular cases, but we shall not concern ourselves with this 
problem. 

Taking the equation for the equalisation time to be t = = ı where- 
by a dimensionless constant of proportionality whose order of 
magnitude is usually equal to unity is omitted, we obtain on the 
basis of Sec. 81 perfectly identical expressions for the coefficients 
of diffusion,* kinematic viscosity and thermometric conductivity 
(assuming L ~ l): D ~ v ~ y ~ vl. ; e 

The following table indicates the accuracy obtained: 


air hydrogen 
v= 043 v=0.94 
74 =0.18 y=1.3 
vl=0.27 ASS Es] 


These results should be considered good. Agreement within an 
order of magnitude cannot be viewed as accidental when it is recalled 
how greatly the quantities with which we are dealing vary. 

Using the expression for the coefficient of thermal conductivity 
in terms of the thermometric conductivity, we obtain 

MC pv 
x ~ puley ~ a3 


where m is the mass of a molecule. 


* It should be kept in mind that, in addition to the concept of diffusion 
of one substance in another, there is the concept of self-diffusion, i.e., the 
motion of molecules among similar molecules, e.g., the diffusion of hydrogen 
in hydrogen, oxygen in oxygen, etc. Investigation of this phenomenon became 
possible after the technique of radioactive tracers (atoms and, hence, mole- 
cules) was introduced. 

Thus, D is here the coefficient of self-diffusion. 


oo 
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In this formula, z, the number of molecules in a unit volume, has 
cancelled out. It follows, therefore, that the thermal conductivity of 
a gas does not depend on its density and, hence, pressure. We should 
carefully note this unexpected, but nevertheless perfectly correct, 
conclusion. Increasing the density of gas does not lead to an increase 
in thermal conductivity. 

Another prediction may be made on the basis of the formula for 
the coefficient of thermal conductivity. Since the effective cross- 
section and the thermal capacity hardly depend on the temperature 
(generally speaking, o decreases slightly with increasing tempera- 
ture), and the thermal velocity is proportional to V T, the coefficient 
of thermal conductivity should be proportional to the square root 
of the temperature. 

The data presented below indicate the accuracy of these two pre- 
dictions. For example, for nitrogen at 0°C, 325°C and 500°C: 

%3 1 Xo: x1 = 1.93 31.65: 4 
2 VT: V Te: V M1 = 1.68: 1.48: 1. 

We see that the thermal conductivity increases with temperature 
somewhat more than proportionally to | T. This is due to changes 
in the cross-section and thermal capacity. As can be seen, the thermal 
conductivity is independent of pressure in a very broad interval. 

Similarly, the dynamic viscosity ņ ~ pvl also does not depend on 
the pressure and density of the gas. The temperature dependence of 
the viscosity of an ideal gas should be the same as that of the thermal 
conductivity, i.e., the same proportionality should exist. A numeri- 
cal example will help to fix this point. A 

For nitrogen (T, =273°, T2=289°, Ts= 296°): 

M3: N2 : N, = 1.06 :1.04:4 


Vise VT, : V T= 1.04: 1.03: 1. 
The viscosity of a gas remains amazingly constant with changing 
pressure. When the pressure of CO. changes by a factor of 380—from 
2 mm to 760 mm of mercury—the viscosity practically does not 
change. It remains at all times equal to 14.8 Xx 10-5 poises—to within 
an accuracy of one unit in the third figure. 
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When the free path length in a gas is greater than the linear 
dimensions of the vessel, we say the gasisultra-rarefied. Under normal 
free path length is of the order of 


conditions, the magnitude of the 3 
10 cm and is eRe: proportional to the density. Therefore, at 


a pressure of the order of 10™* mm of mercury, the free path length 
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is measured in tens of centimetres. For vessel dimensions of about 
10 cm, a vacuum or ultra-rarefied gas is obtained at such a pressure. 

It should be noted that even in a vacuum the number of molecules 
per unit volume is large. For the pressure indicated above, 1 cm? 
of gas contains tens of thousands of millions of molecules. 

When molecules cease to collide with one another, and collide 
only with the walls of the vessel, the gas acquires certain special char- 
acteristics. A number of concepts become meaningless under such 
conditions. Thus, it is no longer possible to speak of the internal fric- 
tion of the gas molecules, since there can be no molecular layers 
exchanging velocities in the gas. It isno longer possible to speak of the 
pressure of one part of the gas with respect to another (however, the 
concept of gas pressure against the walls of the vessel retains its mean- 
ing). Also, the concept of heat exchange between different parts of 
the gas and, in general, all concepts related to interaction between 
different parts of the gas become meaningless. An ultra-rarefied gas 
interacts only with bodies within the gas. 

Tt will be useful to illustrate by means of examples the specific 
character of vacuum as a special physical state of a gas. i 

What is the expression for the heat flow from one plate to another 
when these plates have different temperatures 7; and 7T, and are 
located in a vacuum? The essence of heat exchange in this case consists 
in the following: gas molecules strike a wall and rebound fromt it with 
an average velocity corresponding to the temperature of this wall. 
As regards the expression for the heat flow, examining the familiar 
formula 


—T 
q=x 7S? =pevl 


T; —T2 
D 3 

we see that the change consists in the fact that the role of free path 

length is now played by L, the distance between the walls. Therefore, 


the expression for heat flow should assume the following form in 
the case of ultra-rarefied gases: 


q=pcv (T1 — T). 


According to this formula, when ultra-rarefied gases are rarefied 
still further, the heat flow should decrease after the free path length 
becomes comparable to the linear dimensions of the vessel. And this 
is precisely what experiment shows to be the case. 

Also peculiar to an ultra-rarefied gas are the equilibrium condi- 
tions for a gas in two communicating vessels at different tempera- 
tures. In the case of a usual gas, the gas pressures in both vessels are 
the same at different temperatures, but the gas densities are differ- 
ent, i.e., they are inversely proportional to the temperatures. 
Equality of pressures is necessary for equilibrium, for otherwise gas 
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molecules will be knocked from one vessel into the other as a result 
of molecular collisions. 

In the case of a vacuum, the situation is completely different. In 
this case, no collisions occur between molecules and as a result the 
molecular flow between vessels is not impeded. The equilibrium 
condition consists in equality of molecular fluxes. If there are n 
particles in a unit volume and the particles move with a velocity v, 
then nv molecules pass through a unit area per unit time. Thus, for 
equilibrium, nyvy = nW. Since the number of molecules in a unit 
volume is proportional to the pressure divided by the temperature 
(this follows from the equation of state for an ideal gas) and since 
the molecular velocity is proportional to the square root of the tem- 
perature, the condition for equilibrium assumes the form 


Pi- ge 2 

VM Vr 
Thus, it is not the pressures that are equal, but rather the ratios of 
the pressures to the square root of the temperatures. If the gas den- 
sity is increased, the pressures gradually begin to be equalised and 
the usual equilibrium condition is obtained when the free path 
length becomes sufficiently small. 
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PART TWO 
ELECTROMAGNETIC FIELDS 


CHAPTER XIV 
ELECTRIC FIELDS 


86. Vector Properties of Electric Fields: Intensity and Displacement 


The presence of an electric field in a region may be recognised by 
a variety of properties. Thus, an electric field creates a force that acts 
on electric charges. Also, it can induce electric charges on the surface 
of a neutral metallic body. 

By measuring the force acting on a charge q, one can show that 
the force F has different magnitudes and directions at different points 
in space, and at a given point is proportional to q. Hence, it is pos- 
sible to describe an electric field by its intensity E, which is defined 
as follows: 

yee 
q 
The stipulation should be made that q must be small. Then, # may 
be measured at points in space that are sufficiently close to each 
other and the field created by charge q does not noticeably distort 
the measured field. 

A vector field is frequently characterised by so-called vector flow 
lines. The tangent at each point of such a line coincides with the di- 
rection of the vector at this point. This also holds for electric fields, 
which may be characterised by vector flow lines of electric intensity B. 


Numerical examples. 1. The electric field intensity of an incandescent wire 
is tens of volts per centimetre. 7 

2. The electric field intensity of the Earth close to its surface is ~ 
~100 volts/metre— 1 Statvolts 

00 = cm 

3. The electric field intensity of a hydrogen atom’s nucleus at a distance 
Corresponding to the radius of the electron’s “orbit”? is 19.2 108 statvolts/em = 
= 57.6 X 10!0 volts/metre. 

4. The electric field intensity at which air breakdown occurs is30kv/em = 
= 100 statvolts/em. 


(Gaussian units). 


An experiment for the determination of the charge induced by a 
field may be conducted using two small metallic plates fastened to 
an insulated handle as shown in Fig. 89. Such plates are called Mie 
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plates—after the German physicist G. Mie. Placing the pair of closely 
spaced plates in a field, and then carefully rotating them, positive 
charge may be accumulated on one plate and negative charge on the 
other. Moreover, the quantity of induced electricity may be measured 
by an electrometer or ballistic galvanometer. 

Experiments show that the plates may always be so oriented“ that 
no electricity is induced on the plate faces. For homogeneous, iso- 
tropic bodies (and for the present we shall not consider others), this 


be 


/ \ 


Sago ey 


/ i 
! 
! 


Fig. 89 


occurs when the plate faces are parallel to the vector Æ. On the other 
hand, the induced electricity is a maximum when the plate faces 
are perpendicular to the vector #. This enables us to introduce stili 
another vector to describe an electric field, namely, the electric dis- 
Placement D, which is defined by the following condition: The vector 
® is normal to the Mie plates when the orientation of the plates is 
optimal with respect to induced charge, i.e., when a maximum 
charge is induced on them. Moreover, this vector is directed outward 
from the positive Mie plate. In all cases, except for anisotropic bodies, 
® and Ẹ have the same direction. The absolute value of 9 is equal to ø: 


|2|=0, 
where o is the surface charge density of the Mie plate. Since the sur- 
dq 
face charge density o may be written as a," then 


|= 
Ia]= 
It was stated above that the electric field may be characterised 
by the flow lines of vector W. We ciny of course, also desċribe the 
field by the flow lines of vector ®, i the electric lines of force. 
The number of lines of force pa ae oa a unit area perpendicular 
to the force lines is |®|=®, and the quantity 


dN=DdS, 
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is called the electric flux through the area dS. If the same electric 
flux passes through the inclined area dS as through dS, then 
dS 
dS =—+ | 
cosa 
where a is the angle between the normal to the area and the lines of 
force, i.e., : 


dN =Dcosads. 


The flux through a large surface is expessed in the form 
N= $ D cosa dS, 


and the flux through a closed surface is usually denoted by placing 
a circle on the integral sign: 


N= Deosads. 


87. Permittivity 


Experiments show that the two vectors characterising an electric 
field are related. For the case when these vectors are parallel, they 
are also proportional to each other at a given point in space.* 
A change in the vector Æ results ina proportional change in the vector 


®. The ratio depends only on the medium. 


It is customary to characterise the electric properties of a medium 
by the dimensionless quantity e, whose value is selected so that e = 4 
for vacuum. The reason for this condition is the fact that, as will 
presently be seen, no body can exist with e <1. Therefore, it is 
natural to “base” the value of e on vacuum. The quantity e is 
called the permittivity and defined by the equation 


= = &8, 
where £o depends on the choice of units. If the state of the medium 
does not vary from point to point, then e is also constant. At the 
boundary between two media, e changes abruptly. Bodies that are 
nonhomogeneous with respect to density and other properties are 
usually nonhomogeneous with respect to permittivity as well. 
The permittivities of several substances at 18°C are as follows: 
air—1.00059, glass—7.00, paper—2-2.5 and water—80.5. 


* The case of anisotropic media, where the vectors D and E are not paral- 
lel, will be considered on p. 257. 
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In engineering, the quantity D is measured in coulombs/metre ? 
and the field intensity in newtons/coul. Then, gọ is measured in 
coul*/metre? newton and, in these units, 

10-9 coul? 


soar MMO 
yl 36x metre? newton 


In the Gaussian system of units used in physics, ¢9 is dimensionless 


‘ ae 
and equal to aS 


Sy 
AN 


A quantity called the electric induction D may be used instead of 
displacement. It is 4 times larger than displacement and in the Gaus- 
sian system D = e£, 

As we shall soon see, both choices for the value of sọ have their 
relative advantages. The first system simplifies one group of formulas 
but complicates another, while the second system leads to the reverse 
result. 

It should be emphasised that the concepts of electric displacement 
and electric induction have exactly the same qualitative meaning. 
The difference in the numerical factor merely leads to a difference 
between the ratio of an electric induction unit to a charge density 
unit and the ratio of a displacement unit to a charge density unit. 

The electric displacement is equal to unity if the charge density 
on the Mie plates is equal to unity (see p. 227), while the electric 
induction is equal to unity if the charge density on the Mie plates is 


equal to E 
In electrical engineering, as a rule, only the quantity ®, i.e., 
displacement, is used. On the other hand, in physics, the electric 


induction D is used exclusively. 


Several comments are necessary regarding the formulas and units of meas- 
urement that are used in this part of the book. ; 

Although in mechanics and thermodynamics various choices are made for 
the fundamental quantities and various units of measurement are used, never- 
theless, the constants of proportionality are invariably dimensionless. There- 
fore, the form of the formulas in those fields of physics does not depend on the 
choice of the system of units. Ce 3 

Unfortunately, however, the situation is not the same in the case of elec- 
tromagnetic fields. Two general approaches exist, i.e., one approach has been 
adopted in electrical engineering and another in physics. Not only is there 
a difference in the choice of the fundamental quantities and the units of meas- 
urement, but it turns out that we must distinguish between the constants of 
proportionality for one and the same formulas. Of necessity, one must become 

formulas in both systems. This will be done in the course of 


familiar with the ti _be 
the presentation, but for the present it suffices to limit ourselves to several 


general comments. 
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In electrical engineering, the so-called MKSA system is widely used. Side 
by side with the metre, kilogram and second, a unit of current is taken as 
fundamental. A current of 1 amp is defined as the current for which the constant 
of proportionality uo, i.e., the magnetic permeability of vacuum, which occurs 
in ealas for electrodynamic interaction (see p. 273), has the value 


[lo =42 x 10-7 joule/amp? metre. 


Experiments show that for such a choice of unit current 1.118 mg of sil- 
ver is deposited per second on an electrode when a current of 1 amp passes 
through a silver nitrate solution. The historical reasons for this choice, which 
may appear strange, will not be explained here. 

All units in the MKSA system may be expressed in terms of the kilogram, 
metre, second and ampere. 

Since in electrical engineering the system of units is based on four funda- 
mental quantities, there is no way of obtaining the same set of formulas in the 
CGS system, which is based on three fundamental quantities. There are, how- 
ever, other differences between these two systems. These are expressed in the 
different choice of numerical, dimensionless coefficients. In the course of pres- 
entation, we shall on occasion list certain formulas in both systems, while 
in the appendix the reader will find a compilation of the electrodynamic for- 
mulas in both systems with the units of measurement indicated. 
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Consider a system of electrically charged bodies forming an arbi- 
trary field. Now, describe a closed surface in this field. Some of the 
charges will fall within the surface, while others will be external 
to it. The result obtained by measuring the electric flux in the out- 
ward direction through this surface is very simple and in no way sur- 
prising. Thus, the total electric charge induced on the surface— 


which by definition is precisely the flux V = $ ® cos a dS—is equal 
lo the total electric charge within the volume enclosed by this surface: 


G Deosa dS =D} a. 


This theorem, named after Gauss and Ostrogradsky, shows that lines 
of electric flux begin on charges of one polarity and terminate on 
charges of the other polarity. Interrupted lines of force do not exist. 

Lines of electric intensity closed on themselves do not exist in 
a constant electric field.* This follows from the second law for elec- 
tric fields, which states that an electric field (more accurately: the 
vector field of electric intensity E) is a potential field. Thus, the 
work performed in moving a charge along a closed curve is equal to 
zero in such a field, i.e., closed lines of vector Æ do not exist. The 
work performed in moving a charge from one point to another. 


* In vacuum and homogeneous media, the Æ and D vector lines coincide. 
In this case, we can speak of the electric lines of force without indicating which 
of the vectors is being considered. 
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depends only on the location of these points and does not change when 
the form of the path changes. In this respect, the properties of an 
electric field are the same as those of a gravitational field. 

Let us select a reference (initial) point in an electric field and cal- 
culate potential energy with respect to this point. No matter what 
path is taken, the work A in moving a charge from the initial point 
to a given point in the field is always the same. Therefore, at this 
point the charge possesses a potential energy U that is numerically 
equal to the expended work A. 

Just as the potential energy in a gravitational field is proportional 
to the mass of a body, the potential energy in an electric field is 
proportional to the charge: ` 


U= 9q. 
The quantity ọ = z, i.e., the potential energy that a unit positive 


charge would possess at a given point in the field, is called the electric 
potential of the field or, simply, the potential. 

The expression for the work performed in moving a charge from 
one point of the field to another follows from the definition of poten- 
tial. Since work is equal to the change in energy, i.e., dA = —dU, 
then 

dA=Fdl=qE dl=—qdq, 


or Bdl= —dg, 


where dọ is the change in potential. 
For a finite portion of the path 


Edl= p — P2- 


Cron 


Thus, the potential difference” is equal to the workyexpended in 


moving a unit charge. 
If a charge moves along a 
Then, 


line of force, vectors need not be used. 


2 
| Edl=91— 92 
1 


— 


* In a variable field, the above equation is not valid. To avoid confusion, 


2 
\ Edl. We refer to it as the 
Jo 


electromotive force (emf) along the path between points 4 and 2. For constant 
fields, the emf and the potential difference are equal. 


it is convenient to introduce a separate term for 
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Finally, in a uniform field, the formula is simplified to 


Pe ee 
a d z 


where d is the distance between points 4 and 2. 
Formulas relating Æ and ® are written without constants of pro- 
portionality and have the same form in all systems of units. 


Examples. 1. Assume that two flat electrodes of area S = 10 cm? are located 
in air at a distance of 5 mm apart and that the potential difference be- 
tween them is 5,000 volts. The intensity of the created electric field is E = 
= 10° volts/metre= 23 statvolts/em (Gaussian units) and the electric displacement 
in the field of this condenser is D — €o92=9X 10-6 coulomb/metr 2°. This means 
that the charge density on the condenser plates is o= 9 X 1076 cou- 
lomb/metre? = 2,7 statcoulombs/cm? (Gaussian units). The electric flux through 
the face of the electrode is N = DS = 9 x 10-9 coulomb and the charge on 
one plate is q = oS = 9 x 1079 coulomb. Thus, V = q, Which agrees with 
the Gauss-Ostrogradsky theorem. 

2. The electric displacement of the Earth’s field close to its surface is D ~ 
~ 9x 10-10 coulomb/metre?. Since the area of the Earth’s surface is § ~ 
~ 5 X 104 metres? and the surface charge density is o ~ 9 X 10-10 cou- 
lomb/metre?, the Earth’s charge is q ~ 4.5 X 105 coulombs. Thus, the electric 
flux passing through the Earth's surface is N ~ 4.5 X 105 coulombs. 


89. Field Calculations of Simple Systems 


Using the electric field relations presented i the previous article 
and from general considerations of symmetry, van determine the 
field for certain simple systems. To determine the field means to 
calculate the electric intensity, induction or potential. It should be 

edge of the potential suffices to characterise the 
an determine the value 


ecomes particularly clear 
if we construct surfaces of equal potential (equipotential surfaces) 


satisfying the equation P (z, y, z) = const. Since the work of moving 
a charge along an equipotential surface is equal to zero, the lines of 
force are directed normally to the equipotential surfaces. Thus, to 
determine the value of | Æ |, one must differentiate P (x, y, z) in 
the direction of the normal. This type of mathematical operation 
1s considered in vector analysis. However, such differentiation is 
easily performed graphically using a curve in which is plotted as 

along a line of force. The tangent of 
int is equal to the negative 


In order to enable the reader to be 
cepts, we shall use examples to analyse the peculiarities of potential 
and the vector characteristics of a field. We r 


principle knowledge of the potential suffices to 


89. Field Calculations of Simple Systems 233 


Point Charge. From considerations of symmetry, one can see that the 
field of a single, point charge is a radial, spherically symmetrical field. 

Consider a sphere of radius r. The electric flux emanating from 
a charge q is equal to 


| Deos a dS =g. 


The angle a is the angle between the lines of force and the surface 
of the sphere, i.e., it is equal to 90°. At all points on the surface, 
® has the same value and may, therefore, be brought out in front 
of the integral sign. Then 


D 0) dS =q 
and, since Gas = 4ar* (area of the sphere), the electric displacement 
at a point located at a distance r from the charge is D =75 and 
the electric induction is D = +. 


The intensity of the electric field is 
2 dS, 
E= Ameger? * 
ad : Ag - 
In this case, the Gaussian system of units in which e9 = zz is more 


convenient. Then Æ =4 and in the case of vacuum 
r2 


PEs 
=: 


Since the field intensity is equal to the derivative of the negative 
potential along the line of force, i.e., 


the expression obtained for the potential of a point charge is 
q 
=>- 


The constant of integration has been assumed to be equal to zero. 


This fixes the reference potential p = 0 at infinity. i 
Thus, the potential of a point charge is inversely proportional to 
the first power of the distance, while the intensity is inversely pro- 


portional to the distance squared. s f - 
If the charge is in a medium whose dielectric constant is e, the 


1 ; 
intensity and the potential are reduced to — of the values in vacuum. 
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The Earth’s potential is equal to 0.07 volt if the potential at infinity is 
taken equal to zero. In electrical engineering, the potential of the Earth is 
assumed to be equal to zero. 


Systems of Point Charges. Let us consider methods of calculating 
fields created by systems of point charges. Assume € = 4 and let us 
use the Gaussian system of units. Then, the potential for a system 
of charges may be written in the form 


q Ih 
o aa da a D 


where rp is the distance from the charge q, to the point of observation. 


In the case of two charges of equal magnitude, but opposite sign, 
we obtain 


When the signs are the same: 


=q (+=) : 
The form of the above formulas may turn out to be inconvenient 
for solving a given problem. Thus, it is often expedient to introduce 
a Cartesian system of coordinates and express the radius r} in terms 
of x, y, z. In the case of two charges, separated by a distance 2a, it 
is convenient to locate the origin of the coordinate system at the 
midpoint of the system, with the x-axis passing through both charges. 
‘Then, 
ri = («—a)?+y2-+-2? and ry=(e-+a)?+y2+2%. 


Sometimes it is expedient to represent the potential as a function of 
the polar coordinates R and p. From Fig. 90, one can see that 


™1=V R24 @—2aR cos @ and r= R?-+a? + 2aR cos P. 


The field intensity of a system of point charges is given by the vec- 
tor equation 


I eee a y Ada T8 h Sn Tr 
Er E T T, ATEA 
Here, a is a unit vector in the direction of radius 7}. 
Re 

Vector addition is used to map the lines of force. 

Universal Formula for Potential. When a field is created by volume 
and surface charges instead of point charges, the potential of the 
field may be calculated if the charge distribution is known. 
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Consider the region of the volume charge to be divided into 
infinitely small volumes dv and the area of the surface charge to be 
divided into infinitely small elements dS. If p = is the volume 


density of the charge and o = ʻa is the surface density, the potential 


0 dv s 
created by a volume dv is equal to “a and the potential created 


Fig. 90 


by a surface element dS is equal to gD, Adding the potentials creat- 


ed by all the elements, we obtain: 


— C pd ie ods 
Q= aaa N Sr 
The radius r is drawn from the point of observation to all the points 
in space where charges pdv and odS are concentrated. 

This formula is rarely used since the charge distribution as a func- 
tion of p and o is not usually given. In fact, the charge distribution 
is generally being sought. 

Field of a Spherical Con h 
having a charge-+ q and surroun | 
face of ats ae is convenient to view the external sphere as 
grounded, whereupon a charge —4 is induced on its inner surface. 
Considerations of symmetry indicate that the field is radial. If we 
describe a sphere of radius r between the condenser spheres and 
apply the Gauss-Ostrogradsky theorem, the result obtained does 


denser. Consider a sphere of radius ra 
ed by a concentric, spherical sur- 
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not differ from that for a point charge, i.e. 


p=. 


r2 


? 


The potential equation has the form 


but the constant in this case should not be discarded as was done 
previously. As is well known, the potential of grounded metallic 
parts is usually taken equal to zero. It will be, therefore, more con- 
venient to set p = 0 at r=ry rather than at infinity. We then 


obtain: const =— L 


rR 
The expression for the potential in the region between the spheres 
has the form 


awh 
=— a 
On the surface of the inner sphere, 
a eet h 
a TA rB 


Recalling that the ratio of the charge to the potential difference 
between the conductors of a condenser gives the capacitance, we 
obtain for the capacitance of a spherical condenser 
at 1 I Arp 
Ee err 
FA s EB 

If the radius of the outer sphere is increased (r p > co), the expres- 

sion for the capacitance is reduced to 


C=r, 
pacitance of a single sphere is equal to the magnitude of 


Thus, the ca 

its radius. 
If the dielectric between the conductors of the condenser has 

a permittivity e, the intensity Æ and the potenti 


of the above values. 
From the formula 


al ~ decrease to £ 
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and for a sphere, 
C=er. 
Thus, the capacitance is e times the value obtained for vacuum. 


The potential and field formulas being used are applicable for 
points in the region between the conductors of a condenser. They 


A 7 


Fig. 91 


are not applicable to points within the first conductor or external 
to both conductors, for Gauss’s theorem yields different results for 
these points. 

If the charge of the internal sphere is concentrated on its surface, 


then for points within the sphere 
$ D cosa dS =0. 


Since an analogous relation is valid for every surface within the 
sphere, this requires that D = 0 and, hence, the field intensity is 
also equal to zero. Thus, Gauss’s theorem shows that there is no 
field within the sphere if all the charge is distributed on its surface. 
Since Æ = 0, the potential @ remains constant and equal to the 
value of p on the surface of the sphere. The above is illustrated in 
Fig. 91 by the curves of E and ọ as functions of r. 


Examples. 1. The electric field intensity at the Earth’s surface is 


7,400 \2_ 
( 6,400 ) ae 


at a distance of 1,000 km from the surface. 
2. The Barth's capacitance is C = 6.4 X 108 cm (Gaussian units) = 700 uf. 
3. The capacitance of condensers used in radio engineering may be as small 
as a fraction of a picofarad (1 pf = 40-12 fd) and as large as thousands of mi- 


crofarads. 


Field of a Uniformly Charged 
such a sphere the field is the sam 


times the intensity 


Sphere. It is evident that outside 
e as for a point charge or a surface- 
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charged sphere, i.e., 


4 9 
where r is the distance from the centre of the sphere and q = 5 na’p 


(0 is the charge density and a is the radius of the sphere). S 
o determine the field inside the sphere, consider an auxiliary 
sphere of radius r < a. The quantity of electricity inside this sphere 
is less than q, being equal to 
ESR r3 
) 3 pSt 
According to Gauss’s theorem 


3 
T 
DX dar? =- q, 


AS TEDS 
~ Gras" 
Hence, the electric field intensity is 


q 
SSS 
4negea3 ’? 
or in Gaussian units: 


q 
B= M: 


It should be noted that the field is equal to zero only at the centre 
of the sphere. Then, as shown in Fig. 92, the field increases linearly 


a and becomes equal to 4 at the surface 


of the sphere (r = a). Here, the for- 
mula for the field outside the sphere 
and the formula for the field inside 
the sphere yield the same result. From 
this radius outward, the field de- 
creases in accordance with an inverse 
Square relationship. The potential 
outside such a sphere is again given 


by a Inside the sphere, the value of 


Fig. 92 does not interest us and will not 
be considered, 
Cylindrically Radial Field. Let us consider the field cro 


a uniformly charged line or cylinder having 


length. Outside the charged region, the fields 
, the same and have the following form: the lines 


ated by 
a charge 4 per unit 


of such systems are 
of force are at right 
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angles to the axis of symmetry and the flux is the same in all radial 


directions. 

In order to apply Gauss’s theorem about the cylindrical line, let 
us consider an auxiliary cylindrical surface of radius r and unit 
height. Since the flux passes only through the lateral surface of this 


cylinder, ® D cosa dS. is equal to the integral over the lateral surface. 


Owing to symmetry (cos a = 1 and Ð is the same at all points of 
the cylinder), 


i.e., 
a 
L 


Dx dar =4 and D= 


2ar * 


Hence, the field intensity is 


or, in Gaussian units, 


er 


Thus, the field of the cylinder is inversely proportional to the 
distance. This formula is equally valid for the region about a charged 
line, the region outside a charged cylinder and the region between 
the conductors of a cylindrical condenser. 

Since dp = —Zdr, we obtain for the potential: 


WELA 1 
Q=>7 mz enst. 


The potential decreases much slower with increasing distance than 
in the case of spherical systems. Thus, for example, when the distance 
lue, the potential decreases 


r is increased to 10 times its original va 


to fe of its value rather than to 7p- 


2.3 oe > 
For a cylindrical condenser, where the radii of the cylinders are 


a and b, we obtain 


2 2q a 
(o—9a=— > (Ina—In b)=57T IT 3 
The capacitance per unit length of such a condenser is 
C=— 


2n% 
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Example. For a coaxial cable having an outer radius a = 18 mm and an 
inner radius b = 6 mm, and filled with an iMsulator of relative permittivity 
= 4.2, the capacitance per unit length is C = 1.91 cm/cm = 2.12 pf/em. 


It should be noted that the formulas derived above do not take 
account of field distortion at the ends of the cylinder and hence, 
strictly speaking, are valid only for infinitely long cylinders. Prac- 
tically, however, the derived formulas are valid if the region of 
“distorted” field is significantly smaller than the undistorted radi- 
al field. 

Uniform Fields. Uniform fields, i.e., fields in which the lines of 
force are parallel and evenly spaced, are created by charges in planes 
of infinite extent. Naturally, the flux is perpendicular to such planes. 
The magnitude of the field is again determined by means of the 
Gauss-Ostrogradsky theorem. Thus, let us consider an auxiliary 
surface in the form of a cylinder passing through a plane of charge. 
If the lateral surface of the cylinder is perpendicular to the plane, 
the flux through the auxiliary surface is equal to the flux through 


the two end surfaces of the cylinder. The integral fD cos adS is 


then equal to 2DS, where S is the area of the cylinder base. The 
charge within the cylinder is equal to oS. Hence, the formula for 
the displacement becomes 


o 
D oy . = 
p i ae o ; oe ano ' 
The electric field intensity is E — Jeg, and in Gaussian = E=—., 

0 ~ ao 

We see that the intensity does not depend on the distance to the 
sc the us now consider a parallel-plate condenser. 
The field inside spherical and cylindrical condensers is created only 
by the inner surface of charge, while in a parallel-plate condenser the 
field between the plates is created by both surfaces of charge. Just 
as in the case of the condensers considered above, there will be no 


field outside the condenser. Between the condenser plates D = o and 
the intensity in Gaussian units is 


47 
E=~o 
A 
In writing the expression for the potential of a uniform field, let 


us reckon the distance x from one of the charged plates in the 
direction of the lines of force, Thus, in the case of a single plate, ths 


potential is written in the form o= — at ox +- const. In the case 


of a condenser, the expression for the potential between the plates 
becomes 


p= = Ox -+-Const,. 
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Hence, the potential difference is 


4x Ax 
Pa — Po =Z 0 (x) — ta) = Fod, 


where d is the distance between the plates. Thus, the capacitance 
per unit area of a parallel-plate condenser is 
=T or, in practical units, =i, 

The above formulas are exact only for plates of infinite extent. 
In practice, they may be employed if the effect of the condenser edges, 
where the nonuniformity of the feld is pronounced, is not 
great. We can determine the field at some point by means of the 
derived formulas only if this point is sufficiently far from the edges. 
More specifically, this condition means that the field created by 
elementary charges located at the edges of the plates should be 
much less than the field created in the neighbourhood of the point 


under consideration. 


Example. Let us return to the condenser considered on p. 232. The distance 
between the plates will be doubled using two different methods: 

Method 41: The plates remain connected to a source of voltage U = 5 kv. 
sos J 

Then, C = H =4.8 pl, g=C,U=9X 40-9 OR = 
1 1 

= {0% volts/metre, D, = 9 X 407° coul/metre? and N, = 9 X 4079 coul. 

After doubling the distance between the plates, we obtain: (ey) ON ais 

qa = 4.5 X 10-9 coul, Ez = 0.5 X 10° volts/metre, Dy = 4.5 X 107° coul/me- 
tre? and Ny = 4.5 X 10-9 coul. Thus, half the charge entered the source. 

Method 2: Before doubling the distance, let us disconnect the plates from 

the source (condenser charge q = const). C2 = 0.9 pf, q= q = 9 X 1079 coul, 

U, = = 10 kv, Bo = on = Ej, =D, and N= Ny. Thus, the 

yoltage on the plates doubled at the expense of the work of external forces. 


Field on the Surface of a Metal Object. There is no electric field 
inside a metal object. This follows from the fact that all of a con- 
ductor’s charge is located on its surface. According to Gauss’s theo- 
rem, the field is directed outwardly. 

The surface of a metal object is obviously an equipotential sur- 
face, for otherwise the electric charges would redistribute them- 
selves on the surface of the conductor. It follows, therefore, that the 
lines of flux leaving the surface of the metal object must be perpen- 
dicular to the surface. Since all the flux leaves the surface in a sin- 
gle direction, then, according to Gauss’s theorem, D = o lines 
emerge from unit surface area. In other words, the field intensity at 


A i $ i 4mo 

the surface of the conductor is equal, in Gaussian units, to # = =< g 
Electric Images. Let us consider the electric field created when 

a point charge is placed near a plane metallic surface. Due to elec- 


16—1409 


tric induction, an electric charge of opposite sign accumulates on 
the surface of the metal near the point source. The density of the in- 
duced charge is greatest directly opposite the point source and de- 
creases to zero at infinity. Similarly, for the electric field. 

Let us now consider this problem quantitatively. Since the sur- 
face of a conductor is an equipotential, the conducting surface may 
be considered grounded without in any way reducing the generality 
of the problem. Hence, the polential of the metal plate is equal 


SSSSTTISSS 


y 


WD 


SOSETSS SES 


a) b) 
Fig. 93 


to zero and there is no field inside the plate. We are interested in 
the electric field in the right half-space. The electrical properties of 
this half-space are uniquely determined if the magnitude of the 
charge and its distance from the equipotential plane are given. Itis 
important to note that it is entirely immaterial what is located to 
the left of the zero-potential surface. It is proved rigorously ‘in 
courses on mathematical physics that the field in a particular 
region is uniquely determined if the charge in this region and the 
boundary conditions for the potential are given. 

In Fig. 93a, the field is plotted for two charges of equal sign. Now, 
consider the space of this field to be divided into two symmetrical 
parts. The half-space of this figure is then exactly equivalent to the 
half-space of a charge near a metal plate (Fig. 93b) and the fields of 
such half-spaces should be identical. This is the basis for the follow- 
ing procedure, known as the method of images. We “reflect” the 
electric charge in the surface of the metal plate. In the right half- 
Space, the electric field of the charge and its “image” should coincide 
with the unknown field. Thus, the unknown electric field is expressed 


is 
E 


by the formula 


where r, is the distance of the observed point from the charge and rs 
is the distance of this point from the charge’s image. 

The second conclusion to be drawn is that the electric charge is 
attracted to the surface of the metal plate with the same force as to 


its electric image, i.e., the force of attraction is J ı Where a is the 


distance of the charge from the surface. 

Finally, this approach to the problem enables us to determine the 
distribution of the induced electric charge on the surface of the 
metal plate. This requires that we differentiate the expression for 
the potential in the direction normal to the surface. We obtain 
thereby the electric field intensity Æ, which in accordance with the 
formula given in the preceding section of this article must be multi- 


plied by = to obtain the charge. 


The method of electric images has numerous applications and 
enables us to solve electrostatic problems involving systems of 
nonplanar conductors with point charges located in the vicinity 
of the conductors. 


90. Electric Energy 


Energy of a Condenser. It is easily demonstrated that a charged 
electric condenser possesses energy. Moreover, the measurement of 
the magnitude of this energy is not difficult. Thus, for example, we 
can discharge a condenser through a conductor and measure the ther- 
mal energy released thereby. However, we need not resort to experi- 
ment to determine the factors on which the electric energy of a con- 
denser depends. The formula for this energy follows directly from 
familiar theoretical propositions. 

To simplify this discussion, let us consider a condenser in which 
one of the conductors is grounded. The process of discharging the 
condenser (grounding the second conductor), which is charged to 
a potential difference p by a quantity of electricity q, may be viewed 
as the successive outflow to ground of elementary charges dq under 
the action of electric field forces, Therefore, the work performed by 
the field in this simple process is equal to pdg. As the discharging 
process proceeds, the work performed in transferring each successive 
quantity of charge to ground becomes less and less, for the potential 


difference n= 4 is constantly decreasing. The total work performed 


16* 
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by the field during condenser discharge is 


` This quantity is quite understandably referred to as the electric 
energy of the condenser. Using the relationship between potential 
and charge, we obtain for the energy: 


Thus, for constant potential difference, the electric energy is propor- 
tional to the charge squared. A constant difference of potential is 
maintained when the condenser is connected to a constant source. 
Moreover, if the condenser conductors are insulated, the charge is 
constant. Then, the energy is proportional to the potential squared 
and directly proportional to the capacitance of the condenser. 
Energy of a Field. In the case of a parallel-plate condenser of infi- 


nite extent, the energy formula may be written in the form We; = a 
when using MKS units or in the form We: = “Tr when using Gaussi- 


an units. These formulas give the energy per unit condenser area. 

The energy formulas may be written in terms of intensity Æ rather 
than potential difference p. Making the substitution @ = Ed, we 
obtain 


£9eE2 7 gE? 
Wer = = d, or Wa =p t 
r 5 72 g2 
Thus, the enetgy per unit volume is Z, Let us call w= the 


electric energy density. 

Now, let us consider an arbitrary electric field. Assume that the 
equipotential surfaces and lines of force are plotted and that the space 
is divided into small volumes dv, each of which is bounded by two 
adjacent equipotential surfaces and a lateral surface passing through 
lines of force. Each of these volumes is like a small volume in a par- 
allel-plate condenser and, hence, the electric energy associated with 
such an element is aw = dv. If this expression is integrated over 
the entire volume occupied by the electric field, the formula obtained 
yields the electric energy of the system creating the field. 

Thus, the formula for the electric energy has the form 


Wea= f ay WY: 


The significance of the above mathematical transformations goes 
beyond the formal convenience of using one or another formula. 


plier 
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This new expression for the energy enables us to speak not only of 
the energy of the system creating the field, but of the energy of the 
electric field itself, and leads to the concept of a real electric field. 
In the case of constant fields, this conception can neither be con- 
firmed nor refuted. However, in the case of varying fields, we find 
direct evidence for the existence of electromagnetic fields (see p. 3214). 
Hence, the derived formula for the energy of a field (energy of elec- 
tromagnetic matter) is of fundamental significance. 


Example. Let us continue with the example considered on p. 241. Before 
the plates were moved, the energy stored in the electric field of the condenser 
was na =22.5 X 10-* joule and the energy density W1 =4.5 joules/metre®. 
After the plates are moved by the first method (voltage U = const), the ener- 


gy becomes Wz = = = {1.25 X 1078 joule and the energy density Wz = 


= 1.12 joules/metre® (the volume of the field doubled). The energy of the source 
increases at the expense of the work of external forces and a decrease in the 
energy of the field. ‘After the plates are moved by the second method (q = const), 
the energy W = 2W; = 45 X 107° joule and the energy density does not 
change, i€., W = 4.5 joules/metre®. 


Energy of Interaction. When two oppositely charged bodies draw 
together, the work performed by the forces of the electric field is, 
naturally, at the expense of the energy of the electric field: dA = 
= — dWe. Thus, as indicated on p. 54, the work of the electric 
forces is performed at the expense of a decrease in the potential 


energy ue, This energy is appropriately cailed the interaction 


energy of the charges. a 
What is the relation of this formula to the formula for the elec- 


tric field energy considered above? It is evident that the interaction 
energy is a part of the electric field energy of the charges under con- 
sideration. Carefully examining the formula for the field energy, 
we note that the electric energy has definite meaning even when there 
is only one electric charge in the region. The energy of the field cre- 
ated by a single charged body is appropriately called the self-energy 
of the electric charge. We can always resolve the electric field energy 
into the self-energies of the individual electric charges and the 
interaction energies of these charges. ` 

Let us designate by E1, jioa G field intensity created by the 
first, second, etc., charges. The total field is equal at each point in 
space to the vector sum of the intensities: E=E, 4E}. 


The electric energy density is 
orek DE 


SEa Et. 1E EE, + EEs + apt y 


8a Ax 
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Clearly, the individual terms of this expansion correspond to the 
energy components discussed above. Thus, the Z#-squared terms 
yield the self-energies and the terms involving the product of two 
different intensities yield-the interaction energies. The interaction 
energies of the charges may be positive or negative quantities. On 
the other hand, the self-energies of the charges and the total energy 
of the field must be positive. 

As a rule, only the interaction energies of electric charges are in- 
volved in a problem. We can calculate, therefore, the work of elec- 
tric forces by determining the decrease in the energy of the field. 
The easier calculation is the one that should be performed. 


91. Electron Radius and the Limitations of 
Classical Electrodynamics 


Let us calculate the self-energy of a spherical charge, assuming 
the electricity to be distributed on the surface. The electric field 
is then only outside the charge. Therefore, the energy of the field 
must be integrated over the region external to the sphere. If the 
charge is in a vacuum, the field intensity is expressed by the formula 
4 and the energy density at any point in the region has the form 

2 
eet . Consider the entire region to be divided into spherical layers. 
The energy contained in such a shell, whose inner radius is r and 
outer radius r+ dr, is E X vol. of shell. Since the volume of 
the shell is equal to 47r? dr, the energy in this spherical layer is given 
by the simple expression ear. To determine the total energy of 


the field, this expression must be integrated from a (radius of the 
spherical charge) to infinity. Thus, 


22 

= dr q? 

iy E f a 

J 2 r2 2a * 
a 


pei is the form of the energy formula for an electrically charged 
sphere. 

We shall leave it to the reader to show that if the charge is dis- 
tributed throughout the volume of the sphere the energy formula 
obtained is almost the same. The only difference is that in this case 
the formula contains a coefficient close to unity. 

What is the result if the above formula is applied to an elementary 
particle, e.g., an electron? 

According to the principle of relativity (see p. 423), the internal 
energy of a body of mass m is given by the expression mc*, where 
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c is a universal constant equal to the propagation velocity of electro- 
magnetic waves in vacuum. Equating the two energy expressions, 
we obtain the formula for the electron radius: 


q 
“= n 
Substituting numerical values* in this interesting formula, we 
obtain a = 1.4°x 10-1* cm. There is considerable indirect evidence 
in physics that the order of magnitude of the electron radius deter- 
mined in this manner is quite correct. 

Nevertheless, the conception of an electron as a “usual” electric 
particle is clearly false, for we are immediately confronted with the 
problem of the forces holding the component parts of an electron so 
close together. We know that the forces of repulsion between electric 
particles separated by a distance of the order of 41071? cm is tre- 
mendous. ; 

Furthermore, there are other theoretical difficulties. Thus, it 
follows from the theory of relativity that an electron should be 
a mathematical point. At the same time, the electric energy of 
a charge concentrated at a point is infinitely great. 

These difficulties are typical for so-called classical physics, which 
developed in the main in the 19th century. Classical physics excel- 
lently describes the behaviour of macroscopic bodies. In fact, by the 
turn of the century many scientists believed that classical physics 
was already so perfected that there was little left to be discovered 
in physics. After the discovery of elementary particles, it was natur- 
al to try to apply the laws established for large bodies to elementary 
particles. This is when classical physics began to “fail”. We now 
know that concepts derived from observations on macroscopic 
systems cannot be simply transferred to atoms, nuclei and electrons. 

The electron problem cannot be solved within the framework of 
classical conceptions. Considerable success has been achieved in elec- 
tron theory during recent years, but a complete theory does not 
exist. Therefore, the classical theory of electricity (electrodynamics) 
presented in this part of the book has certain limitations. These 
are encountered in studying the interaction of elementary particles. 
When dealing with the behaviour of a single elementary particle in 
fields created by large bodies and, of course, when considering the 
interaction of macroscopic bodies, one obtains, using classical elec- 
trodynamics, results that are in complete agreement with experi- 
mental data. y 


* g= 1.601 X 10719 coul; m= 9.4066 x 10728 gm; and c= 


= 2.99776 x 1010 =. 
sec 
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92. Electric Forces 


In calculating interaction forces between charged bodies, one often 
uses the concept of an electric force field. Instead of saying that 
body A exerts a certain force on body B, we introduce a force field 
and say that body A creates a field and this field acts on body B. 
As we shall see in Chapter XVI, this field is more than an abstraction. 
An electromagnetic field is a physical reality and nature implements 
the interaction transmitted from one point in space to another (“short 
range” action). By introducing the field concept, we can ignore the 
field sources and determine the forces acting on a charged body 
knowing only the field intensities where the charges of the system 
under consideration are located. 

Every charged body is a system of charges. In the case of a system 
of discrete charges, the force acting on the system is F=q, E1 ṣ4 
+QoH>- .... Here E, Eb, ... are the field intensities where the 
charges are located. When the electric charge is uniformly distributed 
throughout a volume, the force acting on the body may be repre- 


sented by the following integral: F = § Ep dv. Tf the electric charge 
is distributed over a surface, the force is represented by a surface 
integral: F = $ Eods. 


However, one precaution must be taken when the force is deter- 
mined directly in this manner, namely, the value of the intensity used 
in the formulas must be the intensity existing in the absence of the 
charge on which the force is acting. In the formulas in which the force 
is expressed as a sum, the action of charge q; on itself does not enter 
into intensity #;, i.e., in calculating Æ; the field created by q; is 
not considered, etc. The same is true for the integral formulas, i.e., 
the field intensity under the integral sign is the intensity created 
by the entire distribution of electric charge, except for the quan- 
tity of electricity located at the point under consideration. 

Let us illustrate this by means of the force acting on a charged met- 
al surface. As we know, the electric field intensity on the surface of 
a metal bounded by a dielectric is equal on the dielectric side to 


áno ; ; RAAH 
—* in Gaussian units, and on the metal side is equal to zero. The 


field intensity is broken on this surface. To determine the force acting 

on an element of surface, we must multiply the quantity of electric- 

ity odS by the intensity which would exist at this location if the 

element of charged surface under consideration were removed. There- 
A 

fore, it would not be correct to multiply odS either by = , the 

value of the field on the dielectric side, or by zero, the value of 


i 


a 
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the field on the metal side. It can be shown rigorously that the 
field existing at this location after removing the element under con- 


3 P e = 4T 9 3 

sideration is equal to the arithmetic mean of 0 and = pices: IS: 
2m0 n ; 

equal to ae Thus, the formula for the force acting on an element 

of a conducting body’s charged surface has the form 


eek 
210 
dS, 
€ 


and for the entire body 
pain) Sas. 


The integration in the above formula must be performed over the 
entire surface, taking into consideration differences in charge den- 
sity and dielectric con- 
stant along the metal sur- 
face. 

In the case of a uniform 
field (ideally, between the 
plates of a condenser of in- 
finite extent), the force Æ 
acting on the plate area S 


may be determined with- me 
considerable accuracy by 


the formula 
202 Fig. 94 
TO S 


pea 
ad 8 


The magnitude of this force may be measured by means of a Thomson bal- 
ance, whose method of operation is illustrated in Fig. 94. When the potential 
difference between condenser plates of 50 cm? area is 600 volts and the distance 
between plates is 5 mm, the force of attraction between the plates, calculated 
in the two systems of units that we have been using, may be determined, as 


follows: 
Gaussian Units MKS Units 
asia EES fee 058 5 x 10-8 
Osa sta 36r 5x107 
q=CU=8 xX 2=16 statcoulombs = 8.9 x 10712£ 


g=0.32 statfarad/cm? q=CU =8.9 x 10-12 x 600 = 


= 5.3 X 10-9 coul 
ono? . 2% 3.14 X 0.322 
2ra sa = x a= 1.06 X 1076 coul/metre? 


x 50=32 dynes 


F 


2egg 
(1.06 x 1076)2 x 5 x 1078 


TE EOR 
2x1 eee | 


=32 10-5 newton 


250 Electric Fields 


Thus, in order to balance the force of electrostatic attraction, one must place 
a 33.3 mg weight in the opposite pan. 


It is still more difficult to determine the force acting on a body 
having a distribution of volume charge. Here, in the expression 
oF dv, the intensity His the intensity of the field created by all the 
charges except pdv. 

If the charged body is in a dielectric medium, calculation of the 
force is complicated by the fact that when we consider the charge to 
be removed it is also necessary to consider a corresponding portion 
of the dielectric removed, which means the polarised state changes 
(see below). 

If we wish to avoid the difficulties connected with “subtracting” 
the effect of the charge on itself, the force must be determined from 
the energy expression. The decrease in energy is equal to the work. 
Then, if we know the magnitude of the displacement, we can deter- 
mine the value of the force. As a rule, this is precisely the method 
used in determining the force. 


Calculation by this method of the force acting on the plate of 
a parallel plate condenser, F = a S, may serve as a vivid illustra- 


tion of the above. Observing the attraction between the plates of 
the condenser (disconnected from a source of voltage), we can imme- 
diately write the expression for the change in energy when the 
plates come together by an amount A: 


SA = ESA 


ts 
LN 
me sn S. p wie 
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Hence, the force sought is 


Let us return to electrical systems that may be represented as sys- 
tems of point charges. Assume that the electric field is uniform in 
the region of the system of charges under consideration. Then, the 
formula for the force acting on the system has the form 


F=(41+@-+...) E=QE, 


where Q is the total charge of the system. If a body is electrically neu- 
tral, as in the case of an atom or a molecule, the force acting on the 
body, which contains equal quantities of positive and negative parti- 
cles, is equal to zero. Does this mean that an electrically neutral body 
does not interact with the electric field? It is easily seen that the 
answer is no. In a uniform field, the forces acting on the charges of the 
system are parallel to each other. We can determine the resultant of 
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the forces acting on the positive charges and the resultant of the 
forces acting on the negative charges. As is well known, the resultant 
of parallel forces is exerted at the centre of “gravity” of a body. 
Since we are dealing here with the electric centre of gravity, the 
word “gravity” has been placed in quo- 

tation marks. Thus, all the forces acting ———__________, 
on the charges of a system located in a 
uniform field may be reduced to two 
antiparallel forces. One of the forces is 
applied at the centre of gravity of the 
positive charges and the other at the 
centre of gravity of the negative charges 
(Fig. 95). If the system is electri- 
cally neutral, the two forces are equal 
and the total force is zero. However, a 
couple of forces of moment M =qEl sin a 
will act onthe system of charges if the 
centres of “gravity” of the positive and 
negative charges are displaced with re- 
spect to each other. 

The vector p = ql; equal in magnitude to the product of the 
positive charge of the system and the distance between the centres 
of gravity, is called the dipole moment of the system. It is considered 
to be directed from the negative centre to the positive centre. The 
dipole moment of a system determines its behaviour in a uniform 
field. Thus, a system left to 
itself in a uniform electric 
field tends to turn until the 
dipole moment is parallel to 
the direction of the electric 
field (sin œ = 0). 

In a uniform field, the entire . 
effect on a neutral system of 

Fig. 96 electric charges is reduced to 

: the moment of force M = pE 

sin œ, where p is the dipole moment of the system and is equal to 

the product of the quantity of electricity of one sign and the dis- 

tance between the dipole charges. Thus, in a uniform field, there is 

no need to consider the complex distribution of a particular 

system of charges. It suffices to replace it by the corresponding 
dipole. 

it the system is located in a nonuniform field, the dipole moment 
can no longer completely describe its behaviour. This is illustrated 
in Fig. 96 where four charges, located at the corners of a square, 
comprise an electrically neutral system having a dipole moment 


Fig. 95 
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equal to zero. This is because the centres of gravity of the negative 
and positive charges coincide. In a uniform field, neither a force 
nor a moment of force acts on such a system. In a nonuniform 
feld, however, it may undergo translatory as well as rotational 
motion, since generally speaking the forces acting on the charges 
differ. By analogy with the dipole, such a system of four charges is 
called a quadrupole. Anoth- 
er neutral system having 
zero dipole moment, called 
an octupole, is also shown 
in Fig. 96. 

Of great importance in 
r the study of the structure 
of matter, to which we 
shall devote a great deal 
of attention later, is the 
consideration of the inter- 
action of simple electric systems. Let us consider several such 
systems. 

Charge-Charge. The interaction between two point charges is, im 


accordance with Coulomb’s law, F = a : 


Charge-Dipole. A dipole left to itself tends to turn until it is par- 
allel to the lines of force. After it has turned, the dipole remains 
stationary in a uniform field, while in a nonuniform field it will be 
drawn toward the region of greater field intensity (Fig. 97). If the 
nonuniform field is the field of a point charge, the dipole will be 
drawn toward this charge. This force of attraction is 


al (ee eee ; 

r= (7 CEDE ): : 
: ; 3 ae 

Assuming the distance between the dipole char 


n } be small, and 
reducing the expression to a common denominator, we obtain the fol- 
lowing interesting formula if we neglect the quantity l relative 


to rl and the quantity rl relative to 7?: 


/ 


Fig. 97 


n 


Note that the force of interaction between the charge and the dipole 
decreases more rapidly with increasing distance than the Coulomb 
force, i.e., it is inversely proportional to the distance cubed. 


Example. The distance between the H atom and the Cl atom in an HCI 


molecule is equal to 1.28 A, and the dipole moment of the molecule is p = 
= 6 X 10-18 statcoul cm. Therefore, an electron located at a distance "= 


= 10 A from the molecule is attracted to it with a force of ~ 6 x 107° dyne- 
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Dipole-Dipole. Here, it is convenient to solve the problem for 
the two cases in which the dipoles are arranged as shown in Fig. 98. 
The exact interaction formulas have the form: 


9q2 
P= = -- — for arrangement (a) 
2p2 3r 12 
nan a a ee x 
F=- xX TEE for arrangement (b). 


If the distance between the charges of a dipole is small the above 


formulas may be replaced by the i 
following approximate expressions: 
M Sps a) 
F = for arrangement (a) r > 
6p? 
rt 


F= for arrangement (b). 


H—: 
Thus, the interaction forces are ) 
inversely proportional to the P 
fourth power of the distance. 


Example. Two HCl molecules that 


are 10 A apart are attracted with a force 
TF ~ 10-8 dyne in the case of arrangement (a) and with a force F ~ 
x 10-6 dyne in the case of arrangement (b). 


Charge-Quadrupole. Assume 
a that the orientation is as 
shown in Fig. 99. The interac- 
tion force may then be 
written in the form 


Fig. 98 


r 


> $ T: 
F=2q0( pa aa E 


and the approximate formula 
for a small quadrupole is 


2 
Gi tE . Thus, the force is 
Fig. 99 inversely proportional to the 


fourth power of the distance. 


94. Polarisation of an Isotropic Dielectric 


As we know, when a region containing an electric field created by 
a system of charges is filled with a homogeneous dielectric, the 
field intensity and the magnitude of the electric potential are de- 
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creased to of their original values. On the other hand, the electric 


displacement and induction remain unchanged, and the capacitance 
of a condenser increases to e times its original value. The latter 
effect is often utilised in the measurement of dielectric constants. 
Thus, the ratio of the capacitance of a condenser containing a di- 
electric between its plates to the capacitance of the same condenser 
without dielectric may be used as a definition of dielectric constant. 

Let us now go a step further and inquire into the reasons why the 
dielectric affects the electric field. The following experiment sug- 
gests the explanation for this phenomenon. 

Consider a parallel-plate condenser connected toa source of volt- 
age. The electric charge density on the condenser plates and, hence, 
the number of D-lines per unit area are uniquely determined by the 


electric field intensity, i.e., o=. Let us now fill this condenser 


with a homogeneous dielectric. The relation between the electric 
field intensity and the charge density on the condenser plates is then 


expressed by the equation oa , i.e., the flux density (or D-lines) 


increases. In this experiment, the electric field intensity cannot change, 
for it is equal to the potential difference divided by the distance 
between the plates. Therefore, the charge density on the condenser 
plates changes, i.e., it increases to e times its original value. This 
increase may be observed experimentally. Thus, as we fill the con- 
denser with the dielectric, the voltage source adds charge to the 
condenser. By measuring the electric current and the time of flow, 
one can show that the quantity of electricity added per unit con- 


denser area is 
a ya Cage 


As we remove this dielectric, the additional charge returns to the 
source and the additional force lines disappear. To explain the 
additional attraction of charge to the condenser plates, one must 


P c . 5 e—1 
assume that charges of opposite sign, having a density o= ta Es 


are formed on the dielectric surface next to the condenser plates. 

The surface charge of the dielectric may be explained if we assume 
that the dielectric consists of bound pairs of positive and negative 
charges that cannot move through the body, but can move relative 
to each other, forming thereby a dipole moment in each unit volume 
of the dielectric. This transformation of an electrically neutral 
system of charges into a system having a dipole moment is called 
polarisation, and the dipole moment vector of a unit volume of 
dielectric is called the polarisation vector P. 
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Polarisation of a dielectric does not produce volume charge. The 
numbers of positive and negative charges per unit volume remain 
equal to each other after displacement. However, polarisation does 
produce a charge on the dielectric surface. This is illustrated sche- 
matically in Fig. 100. The density of this charge is equal to the 
value determined above, i.e.: 

e—1 p 
Oop Sara E. 

In our discussion, we have considered a dielectric adjacent to the 
plate of a parallel-plate condenser. However, the situation is the 
same for a conductor surface 


of any shape. Moreover, it 

turns out that the above Ahhali tnta kitiith 
expression for opoz is of gen- 

eral validity for surfaces (] i] (| Į (| Į (] (] Îi (] l] l f 


perpendicular to lines of 
AIT TA TTR. 


force. Thus, in every case, 


e—1 


o=— Ev 


where Ep is the projection 
of the intensity on the nor- Fig. 100 
mal to the surface. This for- 
mula is applicable to any real or imaginary boundary in a dielectric. 
A polarised charge (also often called a bound charge) may be 
expressed in terms of the dipole moment of a unit volume. When 
a field is applied in the case of isotropic bodies, the displacement of 
the bound charges occurs along the electric lines of force. Therefore, 
the polarisation vector is parallel to the intensity vector. From 
a dielectric plate, let us cut out a cylindrical rod having a base S 
and a length J. Owing to polarisation, equal and opposite bound 
charges will accumulate at the ends of the cylinder. The dipole mo- 
ment of the rod is equal, by definition, to the product of the charge 
oS and the distance l separating the charges of the dipole, i.e., 
P = Opor Sl. The dipole moment of a unit volume is | P| = opoz- 
It has been assumed in this calculation that the base of the cylin- 
der is perpendicular to the polarisation direction. If the base is 
inclined at an angle a with respect to this position, the charge densi- 
ty on the ends of the rod will decrease as the cosine of the inclination 
angle. Thus, in the general case, the following relationship holds: 


Opou =Pn, where P, =P cosa. 


We are now able to establish a relationship between the polari- 
sation vector and the electric field intensity vector. Combining the 
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last formula with the expression for the density of bound charges 
discussed at the beginning of this article, we obtain for any direc- 
tion n: 


Thus, if the dielectric constant does not depend on the intensity, 
a linear dependency exists between the vectors P and E: 


P=cE. 
The expression œ = = is usually called the electric susceptibil- 
ity. For water a = 6.38 and for glass a = 0.48. } 
Since D = eB, the relationship between the vectors D, Æ and P 
may be expressed in the form 


D=E44aP. 


When the medium is homogeneous, the vectors D, E and P are 
parallel. 


95. Polarisation of Crystal Substances 


Hitherto we have considered the behaviour of a substance that is 
characteristic of an amorphous or finely crystalline body, or of 
a monocrystal in certain special orientations relative to the field. 
However, if a plate is cut from a monocrystal at an arbitrary angle 
to the crystal faces and if the plate is then placed between the con- 
ductors of a condenser, the following effect may be observed: The 
plate becomes polarised perpendicular as well as parallel to the 
lines of force, so that P is not parallel to W. Therefore, in this 
case, D is also not parallel to the field intensity. 

In monocrystals, the inclination direction of a free electric charge 
(E) does not coincide with the direction of the normal to the surface 
oriented in such a manner that a maximum charge (D) is induced on 
it. The relationship between D and Æ becomes more complex and 
in order to find D in terms of Æ, or vice versa, it is insufficient to 
merely know the dielectric constant. In any monocrystal, three 
directions (main axes) in which D || E may be found. If we know 
é for these three directions, the relation between D and E for any 
arbitrary orientation of the crystal in a field may then be established. 
How are the vectors D, # and P related in this case? It turns out 
that the equation P = E + 4x P, introduced in the previous 
article for the case of parallel vectors, is also valid when the vectors 
cease to be parallel. There is another difference between crystals 
and amorphous bodies in regard to their dielectric properties, 
namely, a relatively small class of bodies belonging to crystalline 
substances possess hysteretic properties. Since these properties 
were first discovered in Seignette (Rochelle) salt, such substances 
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are called seignettoelectric materials. Their characteristics will 
now be discussed. 

Place a seignettoelectric material (for simplicity assume that 
we are dealing with a powder or crystal oriented in the field so that 
D || E) between the conductors of a condenser. Let us vary the 
voltage between the condenser conductors, and hence the field 


intensity Æ =7, and measure the charge density o on the con- 


denser conductors, which for D || Æ yields the magnitude of the 
electric induction D. The magnitude of the induction D increases 
as E increases, but it is not directly pro- 

portional to Z, for D begins to increase D 

less until, finally, saturation sets in. 

Clearly, saturation of D corresponds to 

saturation of polarisation. Let us now 

begin to decrease the emf between the 

conductors. Displacement and polari- 

sation begin to decrease and the curve 

follows a downward path, but not the = 
same one taken during the period of rise. 

As a result, when the emf is completely 

removed (Æ =0), the induction and 

polarisation in the dielectric are not 

equal to zero. The dielectric becomes 

similar to a permanent magnet. It will 

have “north” and “south” electric poles 

and will behave like a large perma- Fig. 101 

nent dipole. 

The subsequent behaviour of seignettoelectric materials is evident 
from the hysteresis loop shown in Fig. 101. To “de-electrify” the 
dielectric, the polarity of the emf on the condenser conductors must 
first be reversed. Then, by increasing Æ in this new direction, 
we can depolarise the dielectric. A further increase in Æ again 
electrifies it, but with opposite polarity Finally, saturation sets in 
again and the process may be repeated in the reverse direction. 

Why is this elfect called hysteresis? The word is derived from the 
Greek and means “to lag”. The loop illustrated in the figure shows 
that the values of D, as well as of P and £, depend on the past state 
of the sample, i.e., on its history., ‘ 

Every crystal that does not possess a centre of symmetry in a 
number of its elements of symmetry (see p. 590) possesses an inter- 
esting property, namely, its dimensions change upon application of 
an electric field. This phenomenon is known as electrostriction. 

Thermodynamic considerations show that if an electric field pro- 
duces a deformation, the deformation in turn will produce polarisa- 


17-1409 
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tion. This is known as the piezoelectric effect. Applications of the 
piezoelectric effect were briefly discussed in Part I. The relation of 
this effect to the structure of matter will be discussed on p. 657. 


96. Finite Dielectric Bodies in an Electric Field 


The following questions may arise regarding a finite nonconducting 
body located in an electric field. What forces and moments of force 
act on such a body? How is the field distorted by the presence of the 
dielectric? 

A dielectric body placed in a field becomes polarised and acquires 
a certain dipole moment. Therefore, the behaviour of such a body 
in an electric field does not, generally speaking, differ from the 
behaviour of a dipole. If the polarisation vector is directed at an 


angle to the field intensity, the 


= orientation of the dielectric will 
be unstable. A moment of force 
will act on the body, which will 
tend to turn the body until the 
vectors P and F are parallel. 
Thus, a dielectric body 
placed ina given uniform electric 
field assumes a definite equi- 
librium orientation that de- 
pends on the form of the body 
Let us illustrate this for the case 
of a dielectric bar. 
; Experiments show that the 
Fig. 102 equilibrium orientation is the 
one for which the longitudinal 
axis is parallel to the lines of force. Why is this so? Is it not true 


that the bar does not have fixed poles? Fig.102 illustrates the reason 
for this peculiar behaviour. The forces acting on the bound charges 
of the depicted rectangular bar-may be reduced to four forces acting 
on four surfaces of the bar. We see that the forces acting on the 
longitudinal surfaces almost balance each other, while the forces 
acting on the lateral surfaces form a couple of forces that orients the 
bar parallel to the lines of force. 

If the body is in a nonuniform field, then, in addition to the mo- 
ment of force, there will exist forces tending to pull the dielectric 
toward the region of greater field intensity. This phenomenon may be 
vividly demonstrated by making a dielectric fluid rise in a tube upon 
applying voltage to a condenser. The forces making bits of paper 
cling to a glass or ebonite rod rubbed with fur or leather are of the 
same kind as those acting on a dipole in a nonuniform field. 


Lay a] 
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Let us now turn to the question relating to the distortion of an 
electric field due to the presence of a dielectric body. First, we shall 
show that the general laws for an electric field lead to an important 
relation between the values of the electric field on either side of the 
boundary between dielectrics. 

The electric field intensity vectors at two neighbouring points 
located on opposite sides of the boundary between dielectrics having 


permittivities s; and £, differ EJ £z 
in magnitude as well as in Fisk \ \ 
direction. Let us resolve IES z 

Dey EEtr 


these vectors into compo- 
nents parallel and normal 
to the boundary. It can be 
asserted that the field par- 
allel to the boundary is the 
same on both sides. If this 
were not the case, i.e., if the 
field on one side were great- 
er than the field on the 
other side, it would be pos- 
sible to create a perpetual 
engine by moving charges 
along the boundary against 
the field where the field is 2 ae 
less and then allowing the Dye Dm N—— > Hig ahi 
charge to move on the other Fig. 103 
side of the boundary (where 
the field is greater) under the action of the electric field forces. 
Therefore, the tangential components of the intensity on both sides 
of the boundary must be equal: 

By = En. 

We shall use the Gauss-Ostrogradsky law to determine the normal 
components of the intensity at the boundary between two media. 
Construct an auxiliary surface in the form of an infinitely thin 
disk so that the parallel surfaces of the disk lie on opposite sides 
of the boundary. Since there is no charge inside such a disk, the 
net outward flux through the disk is equal to zero and, therefore, 
the flux through each end is the same. This requires that the normal 
Components of the induction vectors be equal to each other, i.e., 
Dr, = Dn, Hence, in terms of field intensities, we obtain 


EEn, = &2ĒEn, 
Thus, the normal components of the intensity vectors are inversely 


Proportional to their permittivities. 
17* 
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Fig. 103 shows that in passing from a medium of lower permittivi- 
ty to one of higher permittivity the flux lines are deflected away from 
the normal to the boundary. This means that the number of flux 
lines passing through a unit area increases. 

We are unable to determine the distortion produced in an electric 
field when a dielectric having a particular shape is introduced into 


Fig. 104 


this field. This problem is difficult even when the field is uniform to 
begin with. If a body of arbitrary shape is placed in such a field, 
the field becomes nonuniform not only near the body, but inside 
the body as well. 

Interesting exceptions are ellipsoids, a broad class of bodies includ- 
ing spheres, flattened ellipsoids that practically do not differ from 
plates, and extended ellipsoids that are akin to cylindrical bodies. 
In mathematical physics it is shown that the field inside an ellipsoid 
is uniform as indicated in Fig. 104. Applying the law of flux refrac- 
tion, we obtain the typical fields illustrated for a denser body in 
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a less dense medium (£; < £) as well as for the reverse case (£4 > €2). 
Examples are a glass ellipsoid in air and an air bubble in glass, 
respectively. 

It can be shown that if a symmetrical dielectric body is immersed 
in a uniform field Æ, in vacuum, then Æ; is related to the field inside 
the dielectric as follows: 


E; =E — NP, 


where P is the polarisation vector and NV is a coefficient depending 
only on the shape of the body. In the case of magnetic phenomena, 
it is customary to call the latter the demagnetisation coefficient 
(see p. 292). 

Since in most cases P = = 
sion after simple conversion: 


E= 


E;, we obtain the following expres- 


(techies 
NS 
1+e—1) zr 


The dielectric constant is always greater than unity. Hence, the 
field intensity inside the dielectric is always less than the field inten- 
sity present at this location before the dielectric was introduced 
into the field. 

The coefficient N for a flat plate perpendicular to the field is equal 
to 4n. This is the maximum value for N and the resulting decrease 


in field to sof its original value agrees with the result obtained ear- 


lier for a homogeneous medium. Let us take another extreme case — 
a cylinder whose axis is oriented parallel to the field. Here, V = 0, 
i.e., the field does not decrease in such a body. In all other cases, 
the decrease in the field intensity depends on the dielectric constant. 


47 à 
47 and, therefore, E; = e . For a cylinder 


3 
whose axis is oriented perpendicular to the field, yee 
The field intensity Æ decreases because the bound charges create 


a field of opposite direction. 

As regards the induction field, the bound charges affect it only 
indirectly. Thus, the number of D-lines remains unchanged when 
a dielectric is immersed in the field. However, due to flux refraction, 


the induction inside the dielectric increases. . 


For a sphere, V = 


CHAPTER XV 


MAGNETIC FIELDS 


97. Magnetic Moment 


Magnetic fields act on currents, moving charged bodies or parti- 
cles and magnetised bodies. A variety of instruments exist for deter- 
mining the properties of a magnetic field. The most convenient way 
of characterising the properties of such a field is to describe its 
mechanical action on a current circuit. It is quite feasible to con- 
struct a wire circuit of very small area, which enables us to measure 
the magnetic field quite accurately. Thus, a “test” current circuit 
plays the same role in magnetic field theory as a “test” charge does 
in electric field theory. 

Experiments with such a device lead us to the following basic 
conclusions. At each point of the field, a circuit that is free to rotate 
assumes a definite equilib- 
rium position. The position 
of stable equilibrium is de- 


scribed not only by the orien- 
tation of th cuit axis in 
space, but by the orientation 


of a definite side of the circuit, 
e.g., the side for which the cur- 
rent will appear to be flowing 
counterclockwise as viewed by 
an observer on that side of the 
circuit. Let us call this side 
positive or north and agree to 
draw the normal to the cir- 
Fig. 105 cuit so as to form a right-hand 
screw system with the current 
direction. Thus, the normal emerges on the positive (or north) side 
of the circuit. 

If the behaviour of current circuits is compared with that of mag- 
netic needles, one observes that the normal to a circuit in stable 
equilibrium points in the same direction as a magnetic needle. 
Thus, the basic definition is not contradicted if we call the direc- 
tion of the normal to a free test circuit the direction of the magnet- 
ic field. 

A torque will act on a test circuit that deviates from the equilib- 
rium position (Fig. 105). Moreover, the deviation of the circuit from 


97. Magnetic Moment Í 263 


equilibrium is uniquely described by the deviation of the normal to 
the circuit from the direction of the field. It turns out that the sine 
of angle œ and the torque N are proportional to each other, i.e., 
N ~ sina. Furthermore, for a particular angle œ, the torque is pro- 
portional to the product of the circuit area S$ and the current J flow- 
ing in the circuit. Decreasing the area by a certain factor results 
in the same change in torque as a decrease in current by the same 
factor. 

It follows from the above that the magnetic behaviour of a circuit 
depends on the orientation of the normal to the circuit and on the 
magnitude of the product 7S. This can be expressed by means of 
a single vector quantity—the so-called magnetic moment of the ring 
current. In electrical engineering, the magnetic moment is generally 
designated by the vector M = JSn, where n is the unit normal vec- 
tor. In the Gaussian system of units used in physics, a constant of 


proportionality = enters into this formula, i.e., m= +iISn, where 


c is the velocity of propagation of electromagnetic waves in vacuum. 
The introduction of a numerical coefficient, which is moreover dimen- 
sional, may appear to be an unnecessary complication. However, 
other formulas are thereby simplified. This will become clear to 
the reader later on. 

Experiments with a test circuit show that N = BM sina, where B 
is a constant of proportionality. The volume of B varies from field 
to field and for different points in space of a particular field. This 
formula shows that B is equal to the maximum torque acting on 
a unit test circuit (M = 1). We call this coefficient B, which charac- 
terises the magnetic field, the magnetic induction. The vector quantity 
whose direction is that of the magnetic field and which is numeri- 
cally equal to B is known as the magnetic induction vector. 

If the torque is described by a vector directed along the axis of 
rotation (in accordance with the right-hand screw rule), the formula 
for the torque may be written in vector form as follows N = [MB]. 

When V = 0, M is parallel to B. This means that a current circuit 
tends to become so oriented in a magnetic field that the directions 
of the magnetic moment and the field coincide. The magnetic moment 
acting on a body is a maximum when it is perpendicular to the 
direction of the field. For a circuit, this means that the plane of 
the loop of wire is parallel to the flux lines. 

Having determined the magnetic field by means of a current cir- 
cuit whose magnetic moment has been calculated from measure- 
ments of the current and the area, we can then do the reverse, namely, 
use the formula M = [MB] to determine the magnetic moment of 
systems whose currents cannot be measured. Moreover, we can 
transfer the concept of magnetic moment to systems in which the 
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concept of a ring of electric current has no meaning. This is precisely 
the case when the physicist refers to the magnetic moment of an 
electron or nuclear particle. The magnetic moment of a magnetic 
needle is also a concept that cannot be broken down. However, 
after having discussed certain special effects of the medium, we shall 
return on p. 270 to the magnetic moment of a permanent magnet. In 
any case, the magnetic moment of a system located in vacuum can 
always be determined by the above formula for the torque. 

A body possessing a magnetic moment requires the expenditure 
of work to turn it from its equilibrium position. In the case of a body 


turned through a small angle a, the work of rotation may be ex- 
’ pressed in the form 


Nda = BM sina da = —d (BM cos a). 


The deviation of a body from the equilibrium position is associat- 
ed with the accumulation of a “potential” energy U = — BM cosa. 
This product is the scalar product of two vectors. Hence, U = — BM. 

In the equilibrium position, the potential energy is a minimum 
and equal to —BM. When the magnetic moment is turned through 
an angle of 90° the potential energy increases to zero. Finally, when 
the magnetic moment is directed oppositely to the field (position of 


unstable equilibrium), the potential energy is a maximum and 
equal to + BM. 


Examples. 1. The magnetic moment of the nucleus of a hydrogen atom (nu- 
clear magneton) is 0.505 x 10-2 Gaussian unit. The magnetic moment of an 
electron (Bohr magneton) is 0.927 x 10-20 Gaussian unit. 

2. An electric current of fa flowing in a loop whose area is 50 cm? creates 
a magnetic moment of 5 X 10-8 a metre? = 5 Gaussian units. 

3. In the Gaussian system of units magnetic induction is measured in gausses 
while in the practical system B is measured in volt sec/metre?; 4 volt 
sec/metre? = 10! gausses. In the case of the magnetic field of the Earth, B = 
= 0.49 gauss. 


4. The magnetic induction in the air gap of a powerful electric generator 
attains a value of several thousand gausses. Academician P. L, Kapitsa has 
obtained pulsed magnetic fields in which B ~ 105 gausses. 


98. Ampére Force 


The torque acting on a current-carrying circuit is clearly the 
resultant of the forces exerted on every part of the conductor in 
which current flows. We can experimentally establish the relation 
for the force acting on a current clement. For this purpose, it is 
necessary to isolate a part of the circuit—e.g., by means of mercury 
contacts. This part is then able to move under the action of a force. 
Utilising the tension of a spring to counterbalance the displacement, 
one can measure the magnetic force (Fig. 106). 


—— 


bias- 
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Ampere first established the relation for the force acting on a cur- 
rent element of small length. This relation has the following form: 


I f I AN 
dr=ņ dl, B] i:e.; dF =— dlx Bsin dl, B. 


The vector notation here is suggestive of the familiar left-hand 
rule. The force acting on an element of wire length is always perpen- 
dicular to the plane passing through the current and the magnetic 


Fig. 106 


induction vector at this location. To determine the sense of the 
force, note from which side the rotation of vector dl toward vec- 
tor B, through the smallest angle, appears counterclockwise. This. 
is the positive side in a right-hand screw system and the force vector 
then points toward the observer. The force has a maximum value 
when the current element is perpendicular to the vector field. When 
the wire element is parallel to the flux lines, the force is equal to zero. 

The above formulas are in the form used in physics, i.e., valid 
for the Gaussian system of units. In the form used in electrical 


engineering, the coefficient £ is absent-and the formula for the 


Ampère force is 
dF=T[dl, B]. 


~ 
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To determine the magnitude of the force acting on a piece of wire 
of finite length, one must integrate the above expression for the force: 
Pak f (dl, B]. 


c 


In the simple case of a rectilinear piece of wire of length 1, locat- 


ed in a uniform magnetic field B, Ampeére’s law may be directly 
applied in the form 


1 WAN 
F=— IIB sin GB. 


A perfectly natural relationship exists between Ampére’s law and 
the torque expression derived in the preceding article. We shall 
I consider only the simple 

case of a rectangular loop 
oriented parallel to the flux 
lines in a uniform magnetic 
field (Fig. 107). Two sides 
of the loop are perpendicu- 
lar and the other two sides 
are parallel to the flux lines. 
Therefore, the forces acting 
on the wire elements may 
be reduced to the two shown 


in Fig. 107. These forces are 
equal and in accordance with Ampeére’s law may be written in the form 


F = IIB. As can be seen from the figure, the Ampére forces create 
a torque N = JiBd. But since Id = S is the area of the loop, we 
obtain N = ISB = MB, which is the same as the formula for 


the torque derived in the preceding — e leave it to the 


Fig. 107 


reader to derive a more general proof. 


Example. The force acting on a conductor whose length is 3 metres and 
through which a current of 50 a flows in a field of 3,000 gausses = 0.3 volt 
sec/metre® is F=BI1—0.3 X 50 X 3 = 45 newtons ~ 4.5 kg. In the case of 
a rotor diameter of ~ 4 metre, the torque acting on the loop is ~ 45 newton 
metres. These values are of the order of magnitude of the parameters of a large 
electric motor. In an electrical measuring instrument, a force F = 2 X 10-5 new- 
ton = 2 dynes ~ 2 mg acts on a conductor of length 2 cm through which 
a current of 0.01 a flow. 


s in a field of 100 gausses. For a loop diameter of ~1 cm, 
a torque of ~2 X 40-7 newton metre acts on the loop. 


99. Force Acting on a Moving Charge 


We may go a step further and consider the magnetic forces acting 
‘on currents as forces applied to elemer 


ntary particles of electricity. 
Electric current is simply a flow of el 


f ectric charge. If e is the par- 
ticle charge, v the particle velocity and n the particle concentration 
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{i.e., the number per unit volume), then the expression for the cur- 
rent intensity may be expressed in the form J = nevS. Thus, all the 
particles in a volume vS pass in 1 sec through the wire cross-sec- 
tion S, i.e., the quantity of electricity flowing is equal to nevS 
(Fig. 108). Substituting this expression in the formula for Ampére’s 
law, we obtain: 


dF=Ż [vB] nS dl. 


But nS dlis the number of particles in the conductor volume under 
consideration. Thus, the force acting on one particle is 


f =< (vB). 


‘This force is sometimes called the Lorentz force—in honour of Lo- 
rentz, the distinguished physicist who contributed so much toward 
the development of the theory of electrons. 

The above expression for the force (we shall restrict ourselves to 


the form used in physics, i.e., with the coefficient 2) immediately 


Fig. 108 


Provides the answer to a number of very interesting questions regard- 
ing the nature of motion of electric particles (e.g., electrons and 
Protons) in a magnetic field. The force acting on a moving particle 
is perpendicular to the flux lines and to the particle velocity vector. 
if a particle moves parallel to the flux lines, no force is exerted on 
it. On the other hand, the force is a maximum when the motion 
occurs in a plane that is perpendicular to the flux lines. In this case, 


We obtain f = 1 evB. dicul 
If the field is uniform, an electric particle moving perpendicu ar 
to the field will Acho. a circle, for according to the fundamental 


aw of mechanics this is the nature of the motion under the action 


of a a right angles to the motion. We shall 
constant force directed at rig 8 in a magnetic field later 


return to the problem of particle motion i 
P. 445). 
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Example. Electrons ina cathode-ray tube accelerated by a potential differ- 
ence of 70 volts acquire a velocity of 5 x 108 cm/sec. Upon entering a magnet- 
ic feld of 500 gausses at right angles to the field, each electron experiences 


a Lorentz deflecting force of f = wi evB = 4 X 14071 dyne. Under the action 
c 


of this force, the` electron begins to move in a circular orbit of radius R, where 


2 
R is determined from the relation f = >. Hence, R = 5.6 cm. 


100. Magnetic Fields Created by Permanent Magnets 


Every permanent magnet has two poles.* The flux lines are direct- 
ed outwardly at the north pole and inwardly at the south pole. 
Imagine a surface constructed so that it encloses the north pole. 
We can then determine the total number of lines passing outwardly 
through this surface. By analogy with the corresponding electric 
quantity, this number is called the magnetic flux and is designated by 
the letter ©. The flux through an elemental area perpendicular 
to the flux lines is equal to dD = BdS , . Thus, through any arbitrary 
area, dD = BdS cos a, where a is the angle formed with the flux 
lines by the normal to the area; and through the surface S, ® = 


=\B cos a dS. Finally, through the closed surface, @® = 
= § Boos a dS. j 

The flux ®y, directed outwardly at the north pole and inwardly 
at the south pole, is the fundamental characteristic of a magnet. 
The stronger the magnet, the greater Oy. This somewhat justifies 


the designation “quantity of magnetism”—which is only of histori- 
cal significance—for the quantity proportional to the flux, namely, 


4 E y 
ra mL Sometimes m is called the magnetic mass, but this term 


is even less appropriate. In electrical engineering units, m = ©. 
If the poles of a magnet are small (e.g., in the case of a magnetic 

needle), the flux lines close to the poles are directed radially. 
Using the Gauss-Ostrogradsky theorem, 


§ D cosa dS = 4xq, 


we derived the formula for the electric induction of an isolated charge, 


namely, D = +. Clearly, an “isolated” magnetic pole should 
yield a magnetic induction satisfying the analogous equation: 


m . (2 è 
B=-z.: since $B cosa dS = 4am. 


* The creation of ma 


4 gnets with any number of pairs.of poles is also con- 
ceivable. 
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In the case of the electrical engineering formulas: 


3 
pS since $ B cosa dS =m. 
3 


Zarz? ~ 


To be sure, “isolated” magnetic poles do not exist. The above for- 
mula has meaning only for long magnets having point poles, and 


A 
Si das 


Fig. 109 


then only close to the poles. Nevertheless, this method of 
dealing with the pole of a permanent magnet is fully justi- 
fied in practice. This can be clearly demonstrated by means of the 
expression for the field of a bar magnet 
Considered as a magnetic dipole whose two 
Poles m are separated by a distance l. Fig. 109 
shows the field of a bar magnet and the ideal 
field based on the formula 


where r; and 2» are the distances from the poles 
to the point under consideration. The fields 
are seen to be very similar. 

Calculations yield -good results for the field 
at large distances from the magnet. Thus, if 
the distances 7, and rą are large relative to 
the magnet length J, the distance between 
the poles of the magnetic dipole, we are fully 
justified in considering the poles as points. Fig. 140 
The calculations are exactly the same as the 
Corresponding calculations for electric interactions. Let us compare, 
for example, the values of the magnetic induction created by a bar 
Magnet at a distant point along the magnet axis and at a distant 
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point perpendicular to the axis. In the first case, we obtain 


B m m 2ml 2M 
r2 (r+1)2 r3 o 7 


where M = ml is called the magnetic moment of the permanent 
magnet. In the second case (Fig. 410), 
M 


m 
B=2-7 cosw@=-] . 


Thus, the field along the axis is twice as large. 


In the electrical engineering system of units, the last two formulas 
assume, respectively, the following form: 


M M 
B=; and B=: 


Example. Let us calculate the ma 
of length Z = 10 cm at a distance 
the axis. The magnet cross 
net is 500 gausses. 

The magnetic flux in the magnet, which is the same as the outward flux 


from tne pole, is P = 500 x 3 = 1,500 maxwells. Thus, a “magnetic mass’” 


gnetic induction created by a bar magnet 
r = 1 metre from the magnet, measured along 
-section is S = 3 cm? and the induction in the mag- 


Tae 120 Gaussian units is concentrated at the magnet pole. The mag- 
netic moment of the magnet is 


M=ml=120 x 10 =1,200 ergs/gauss (Gaussian units), 
Then, for the magnetic induction, we obtain 


2M _ 2x1,200 7 
= 3 = 4 40-3 
B= = = a0 =2.4 10-3 gauss 


101. Magnetic Field Intensity 
Let us consider the interaction of an 


a current element (Fig. 114). The magnetic pole creates a field B at 
the location of the electric current. Therefore, in accordance with 
Ampére’s law, the force acting on the current element is 


dF=+T{dl, B). i 


isolated magnetic pole and 


In place of the magnetic induction, we can 


for a point pole. Since the field is direct 
obtain the follow 


substitute the expression 
À ed along the radius, we 
Ing expression for the interaction force: 
=, m Tr 
dr=— T| dt, ~ 
or 


mI 


AN 
dF=-z dlx sindi, r. 
er 
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It is quite natural to assume that the force with which a current. 
element acts on a magnetic pole is represented by the same formula, 
except that the direction of the force is reversed. This assumption 
cannot be directly verified experimentally since an isolated pole and 
an isolated element of constant current do not exist. However, we 
can verify the validity of the above statement by integrating the 
interaction forces in actual cases. It turns 
out that theory and experiment agree. 

Thus, the force exerted by a current ele- 
ment on a magnetic pole may be written in 
the form 

dr=—1[at, =| 


ere 
or, inthe electrical engineering system, i.e., 
. s £ e 
without the coefficient — and replacing m 


ar=72,1[at, =]. Fig. 111 

A minus sign does not appear in this formula because a reversed 
radius vector has been assumed. The direction of 7 is always taken 
as the direction from the field source to the point of observation. 
herefore, in the case of the force acting on the current, 2 was as- 
sumed to be directed from the pole to the current element. Now, when 
the force is exerted by the current on the pole, the radius vector 2 
is assumed to be directed from the current element to the pole. 
he force acting on a unit magnetic pole is called the magnetic 


field intensity: 


m 


Thus, our discussion has shown that the magnetic field intensity 
Created by a current element is given by the formula: 


I r 
dH=—,| dl, =]. 
In the electrical engineering system_of units, this formula for the 
Magnetic field intensity created by,a current has the form 
J a 
JHU- E [ a, z] À 


4nr? r 
Thus, a magnetic field may be characterised in two different ways, 
Namely, by the induction vector and the intensity vector. The for- 
Mer measures the action of the magnetic field on a current and the 
atter measures the action of the field on a magnet. 
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In practice, it is easier to reduce the measurement of intensity to 
the measurement of the torque acting on a magnetic needle (Fig. 112). 
Such a needle located in a uniform field is subject to the action of 
a couple of forces, where the magnitude of the force is equal to mH 


and the arm of the couple is equal to J sin æ. Hence, for the torque, 
we obtain the expression 


N=MAZ sina 


In ‘vectorial form IV = [MH], where M = ml is the magnetic 
moment of the needle. It is seen that this formula is very similar to 
that for the torque acting on a cur- 
rent-carrying circuit. 

The relationship existing between 
the magnetic field intensity and 
the magnetic induction can be de- 
termined experimentally. It turns 
out that in all cases, except in the 
case of anisotropic bodies, the inten- 
sity and induction vectors are par- 
allel to each other. This means 
that the magnetic needle and the 

Fig. 112 axis of the test circuit are always 

: parallel. Moreover, in all cases, 

except in the case of ferromagnet- 

ic substances, a simple linear relationship exists between A and B, 

namely B = uou H, where Ho is a universal constant (the so-called 

magnetic permeability of vacuum) and u isa coefficient characterising 
the medium (the relative magnetic permeability of the medium). 

In the system of units used in physics, uo = 1. This yields the 
same dimensions for the magnetic induction and the intensity. 
The price that must be paid for this identity, however, is the intro- 


duction of the dimensional coefficient + in the formula for Ampére’s 


law. In the system of units used in electrical engineering, the mag- 
netic permeability of vacuum is 


Wo = 4 x 10-7 joule/a2 metre, 


102. Interactions of Currents and Magnets 


The relations considered in the preceding sections enable us, in 
principle to calculate the interactions of any magnetic system. 
We have at our disposal formulas for the forces and torques acting 
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on devices by a magnetic field of any origin: 


Action on a current 


Action on a magnet 


Physics formulas Elec. eng. formulas 

i P 
F=- [dl, B] F=] (dt, B] F=mH 
N=[MB], N = [MB], N =[MH], 
where M=+ IS where M=IS where M=ml 


Formulas relating fields to their sources: 


Field due to current Field due to magnet 
Physics formulas Blec. eng. formulas panes Eleos cue 
dH= r : I [ r EZ Em 
cre [az w. dH = Tar au =| B=a B= Tan 
Bany s __2M __M 
l B= uou H B= B=- 


Substituting any of the lower formulas in any of the upper ones 
and using the relation B = powH, we obtain the formulas for 
magnetic, electromagnetic, magnetoelectric and electrodynamic inter- 
action. We shall illustrate each type of interaction by an example. 

agnetic interaction, i.e., the action of one magnet on another. 
Two poles separated by a distance r interact in accordance with 
Oulomb’s law, i.e., 
mmo 


mmo a 
F= ur? or F 4npour? 


The interaction force is inversely proportional to the magnetic per- 
meability. 

Electromagnetic action, i.e., 
ià Current element exerts a torque on a mag 
ity, we assume that M L H, i.e., the magne 
ular to the flux lines. Then, 

’ AAS 
INN sindl, 7 or 


er2 


the action of a current on a magnet. 
netic needle. For simplic- 
tic needle is perpendic- 


YN 
MI . ° 
dN = nra dl sin dl, 7. 


t8—1409 


274 Magnetic Fields 


The interaction does not depend on the magnetic permeability, 
i.e., on the properties of the medium. 
Magnetoelectric action, i.e., the action of a ‘magnet on a current. 
Consider a current-carrying circuit located along the extension of 
i the bar magnet axis at a distance r 
from the magnet (Fig. 113). The 
torque acting on the circuit is 


MeurMma g 


7 N=M uB sina = 7a 


sina 
4 or 


r — M.M; 
4 i c#im 
Me x N Bars Sing 


a Thus, the interaction does not de- 
pend on the magnetic permeability. 


8 Example. A circuit of area S=20 cm?, 
k through which a current J = 10 a flows, 
Fig. 113 interacts at a distance of 100 cm with 


i 4 ; a bar magnet whose magnetic moment 
is. Minag = 1,000 Gaussian units. The torque acting on the circuit is 


v= Sirie . 
r 
4 ; 3 
Meur =3x 1010 X 10 x 20 x 3 x 109 = 20 Gaussian units and 
N=4 10-2 dyne cm. 


Electrodynamic action, i.e., the action of one current on another 
current. Two parallel currents are attracted with a force 


I I 
dF =—* dl,B, where B= juni = Holt —F dlo, 
iE, 


dF = p Like dl; dle 


ee IT dl; dlg 
Tap 


4nr2 


or dF = pop 


The interaction is directly proportion 
The formulas for the interaction 
derived in exactly the same manner. 


al to the magnetic permeability. 
of magnetic systems may be 


_ «ample. It is essential to take into account electrodynamic interaction 
in the laying of bus bars. If a short circuit should occur, the bus bars and their 
Supporting insulators must be sufficiently firm to withstand large electrodynam- 
ic forces. Assume that a current Ta = Ip = 3 X 104 a flows in parallel bus 
bars separated by a distance d = 20 cm. A force F = BI = oT acts on 
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; Tar a $ 3 
a unit length of each bus bar, where Hra is the magnetic field intensity 
ecenied by the linear current flowing in the other bus bar (see p. 279). 
lus, 
Mol? _ án x 1077 x 9 x 408 


ond x02 900 newtons, 


i.e., a force of ~ 90 kg acts on each metre of busbar. The same result could 
be obtained by integrating the last formula for dF above. 


103. Equivalence of Currents and Magnets 


We have called attention to the fact that a similarity exists be- 
tween the expressions for the torque acting on a magnetic needle and 
the torque acting on a current-carrying circuit. As a matter of fact, 
these two systems behave extremely alike in an external field. If 
each of the systems is characterised by its magnetic moment vector, 
the likeness appears even greater. Both systems tend to become ori- 
ented in the magnetic field with the magnetic moments parallel to 
the flux lines of the field. If the magnetic moment is displaced from 
the position of stable equilibrium, a torque N = [MH] acts on the 
System in the case of the magnetic needle and a torque N = [MB] 
ìn the case of a current-carrying circuit. The potential energies of 
these systems are represented by the formulas U = — MHL and = 
= —M B, respectively. 

If we recall that B = jtouH, the difference between the two for- 
mulas becomes immediately evident, i.e., they differ only with 
respect to the magnetic permeability factor. Hence, as regards 


mechanical action, a magnetic needle of moment M is equivalent 
M 


to a current-carrying circuit of moment M,.= ne 

However, the analogy between these two systems goes even fur- 
ther, We shall now show that the proper fields of a magnetic needle 
and a current-carrying circuit are alike except for a constant factor. 

his similarity occurs at distances considerably greater than the 
dimensions of the system. We shall prove this for a point in space 
in line with the magnetic moment at a distance 7 from the centre-of 
the system. The field of a magnet for such a point has already been 


2M $ 
Calculated and found to be equal to B=—, - It remains to deter- 


Mine the field of a circular loop of current along the axis of the loop. 
In Fig. 114 are shown the vectors of the field intensities due to two 
are elements of current directed into and out of the page, respec- 
tively. The field intensity vectors are perpendicular to their corre- 
Sponding current elements and radius vectors, 1.e., they are in the 
Plane of the page. The sense of the intensity vectors is determined. 
18* 
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from the vector product rule or, what amounts to the same, the 
right-hand screw rule. . 

Since a current element and its radius vector are perpendicular to 
each other, the field created by such an element is equal in this case 


figure. For the field created 
by the “antipodes”, we obtain 


2I dl 


cr2 


aS 


cosĝ, 


where the meaning of the va- 
rious designations is evident 
from the figure. This value of 
field is yielded by every such 
pair of “antipodes”. Therefore, 
the resultant field is obtained 
from the last expression by 
replacing dl, the length of an 
Fig. 114 element, by xa, half the circum- 


ference of a circle. Thus, 
the field intensity along the axis of a circular current, at a dis- 
tance r from the current,* is given by the formula 


2 
jie aes 
ere 


Since AS is the moment of [a circular current (here S = xa’), 
we obtain: Hat and B = upo ay 9 

Thus, we have proved that a magnetic dipole and a current-carry- 
ing circuit are not only equivalent as regards the forces acting on 
them, but also as regards the fields created by them. The nature of 
the equivalence is the same in both cases. A magnetic needle of 


moment M can be replaced by a current-carrying circuit of moment 


In vacuum and for the system of units used in physics, i.e., for 
Huo = 1, the equivalence principle becomes even simpler. In this 
case, a magnetic needle of moment M is equivalent to a current- 
carrying circuit having the same moment. 


* Since we are dealing with large distances, the difference between r and 
the distance to the centre of the system is negligible. 
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Examples. 1. Let us return to the example on p. 270 and calculate the mag- 
netic induction of the same magnet in the practical system of units: 


B=0.05 volt sec/metre?, S=3 x 10-4 metre?, 
@=15 x 10-8 volt sec, m=15X 1076 volt sec, and 
l=0.1 metre, and M=ml=15 X 10-6 x 0.1= 1.5 x 1076 volt sec metre. 


Hence, 


=2.4% 10-7 volt sec/m?, 


which is in complete agreement with the result obtained on p. 270. 
2. Consider a current-carrying circuit for which 7 = 5 a and S = 2 cm?. 
The magnetic field intensity created at a distance r = 50 cm (along the axis 
A k * 2M 
of the circuit and perpendicular to its plane) is H = -57 


{2 TS =——— 5K 8 KINE s 
M= IS 3x100 5x3 x 10X 1 erg/gauss 


and H =1.6 x 10-5 oersted. 


104. Rotational Nature of a Magnetic Field 


_A study of the nature of magnetic lines shows that magnetic lines 
differ basically from electric fields. Electric lines have a beginning 
and an end, i.e., there are no closed lines in a constant electric field. 
On the other hand, experiments show that magnetic flux lines, i. e., 
vector lines of magnetic induction, are always closed. Ii other words, 
such lines have neither beginning nor end. ‘ 

For reasons discussed above, force fields in which the work along 
a closed path is equal to zero are known as potential fields. Vector 
fields characterised by closed flux lines are known as rotational 
fields. A magnetic field is a rotational field. — 

If we describe a closed surface in a magnetic field, the net outward 
flux P = Ê B cosadS through such a surface will always be equal 
to zero. In other words, the number of lines entering this surface 
is equal to the number of lines leaving it. Thus, the equation $ B 
cos «dS = 0 is the mathematical expression of the fact that magnet- 


ic flux lines have neither beginning nor end. 
cle the currents creating the field. 


The magnetic lines always encir € 2 l 
Therefore, the integrals taken along induction or intensity flux 
lines, i. e., $ Bdl and $ H dl, respectively, differ from zero. It is 
more convenient to consider the second integral, for its value is 
Proportional to the magnitude of the electric current encircled by 
flux lines. This can be seen from the basic field intensity formula, 
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‘which shows that H and current strength are directly proportional 
to each other. 


By analogy with electrostatics, \ H dl is called the magnetomotive 
force (mmf). If the integral is taken along a flux line, then 


f H dl= f Hal. 


The magnetomotive force along a closed curve is proportional to 
the current encircled by this curve: , 


Ê H dl=kl, 


where k is a coefficient of proportionality. 
A flux line may encircle more than a single current. Then, using 
the algebraic sum of the currents, the equation assumes the form 


fHdl=k DI. 


Deeper theoretical analysis, which we are unable to go into here, 
shows that the above equation is subject to,two more generalisations. 
First, the integral need not be taken along a flux line, but can be 
taken along any arbitrary circuit. Secondly, the coefficient of 
proportionality in the equation is a constant depending only on 
the properties of the medium and is the same for all geometric 
conditions. Thus, the magnetomotive force is the same for any 
closed curve that encircles a current of specified strength. The shape 
of the curve and its length are of no significance. Tt is immaterial 
whether the curve encircles one current or ten currents and whether 
these currents are rectilinear or curved. As long &S the algebraic 
sum of the currents passing through the closed curve remains the 
same, so does the magnetomotive force. 

Since the coefficient of proportionality in the formula for the mag- 
netomotive force is a universal constant, we can determine & if the 


magnetomotive force can be calculated for any system whose field 
is known. 


We are familiar with the general ex 


intensity of a current element, but mathematical difficulties are 


encountered in calculating the magnetomotive force by means 
of the formula for the field intensity, namely, 


I r 
H= zra, >]. 


pression for the magnetic field 


However, we are also familiar with the formula for the magnetic 


field intensity along the axis of a circular current, namely, H=% : 


tari 
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No special difficulties are encountered in calculating the magnetomo- 
tive force along this axis. We should not be disturbed by the fact that 
the integration is carried out along a straight line, while we are 
interested in the magnetomotive force along a closed curve. As 
a matter of fact, a straight line extending from minus infinity to 
plus infinity is a closed curve, for it is closed at infinity. The expres- 


sion for the magnetomotive force, N H dl, along such a closed curve, 


, A d 3 iSi 
i.e., along the axis of a circular current from minus infinity to 
plus infinity, may be written in the form 

+0 


2M $ 


=% 


dl 
(Vee) 
where a is the radius and Z is the distance measured along the axis 
of the circuit. The integral is easily evaluated by using the new var- 
iable B, defined by the formula Z = cot B. It turns out to be equal 


2 . 
to—>. Substituting 1 Ing for M and equating the value of the 


magnetomotive force to kI, we obtain 


ee (in the Gaussian system of units) 
c 


k=4 (in the electrical engineering system of units). 


The magnetomotive force relation now assumes the form 


§ wdt=== ST or § wr at= >I. 


The magnetomotive force relation is very useful in determining 
the magnetic fields of various systems. Its application is facilitated 
by considerations of symmetry and in this respect the following 
discussion is quite analogous to the discussion relating to the solu- 
tion of the corresponding problems in electrostatics by means of 
the Gauss-Ostrogradsky theorem. Si 

Let us first consider an infinitely long rectilinear current. From 
considerations of symmetry it is evident that the flux lines must be 
circles whose centres lie on the wire axis. It is similarly evident that 
at all points on such a circle the numerical value of the intensity is 
the same. Applying the magnetomotive force relation to such a flux 


line, we obtain: Hfdl= dg Here, fal is simply the length of 
a flux line. If the points under consideration are located at a distance 
Onur. Thus, for the magnetic field 


r from the wire axis, then $ dl = 
nt in the region outside the 


of an infinitely long rectilinear curre 


Ra a 
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wire, we obtain 


H=2 (in the Gaussian system of units) 


H= (in the electrical engineering system of units). 


Let us now determine the magnetic field intensity inside the wire. 
Assume that the radius of the wire is designated by a and that the 
current density is uniform over the wire cross-section. Hence, the 
flux lines within the wire must also be circular. Consider such a flux 


line of radius r. The current flowing through it will pata. There- 


fore, the magnetomotive force rela- 
H tion yields: 
2 
(PSB E ee 
c a? 
A cet 
c ae 
T In the electrical engineering sys- 
B tem of units 
I 
Barak 
Thus, we see that the magnetic field 
Fig. 145 2 paat k 


intensity along the wire axis is 
equal to zero. The intensity in- 
creases with radius and becomes a maximum at the surface of the 
wire. Then, for the region outside the wire, the field intensity de- 


creases, being inversely proportional to the distance from the axis 
(Fig. 115). : 


If the field is determined at a point for which the distance r is much 


less than the distance to the end of a wire, then the formula 7 =—_ 
is also valid for a wire of finite dimensions. 


Example. Let us calculate the magnetic field intensity at a distance of 5 cm 
from the axis of a rectilinear current of 20 a. 


In Gaussian units (I = 20 x 3 x 10° = 6 x 1010 statamps) 


— 215 2% 6x 1010 is 
Perseus = 0-8 oersted 


In practical units (7 = 20 a and r= 0.05 metre): 


22 a De Lis ins 
a 2ar 2X 0.05 ero j 
Another important example of the a 
motive force relation is in the calculati 


pplication of the magneto- 
on of the field of a solenoid. 
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Consider a uniformly wound toroidal solenoid whose circumferen- 
tial length is L. The field within the solenoid is uniform and all the 
flux lines are concentric with L. This system plays the same role in 
magnetic field theory as a parallel-plate condenser of infinite extent 
in electric field theory. Each flux line envelops all n turns. Therefore, 
the magnetomotive force along a flux line of length ZL is 


§ Hal= ni. 


Since § Hal = HL, we obtain 


a 
H==— I (Gaussian system) 


H =71 (electrical engineering system). 


The magnetic field intensity of the coil is determined by its “am pere- 
turns”, i.e., the product of the current strength and the number of 
turns per unit length of the solenoid. It should be noted that the sim- 
Plicity of the last formula is one of the justifications for the electrical 
engineering system of expressing the field equations. Since a solenoid 
is one of the basic elements of electrical engineering devices, simpli- 
fication of the formula for the calculation of its magnetic field inten- 
sity is of great practical significance. 

The formula H = ZI is also valid for a straight solenoid if used 


to determine the field within the solenoid at points sufficiently far 
away from the edges. 


_Example. The magnetic field intensity at the centre of a long, thin sole- 
noid, where ZŁ = 15 cm, n = 1,500 turns and J = 0.4 a, is 


amps 


H =1,000 metras 
In the Gaussian system: 
4n n án 1500 (6 4 563 > 109) = 12.56 oersted; 
fer aioe 1 OR 
amps 
4P _ 4n x 10-3 oersted and 4 oersted ~ 80 Saat 


metre 


105. Law of Electromagnetic Induction and Lorentz Force 


ctromagnetic induction | discovered by 
Faraday, the great English physicist, may be described as follows: 

n electric current is induced in a closed conductor loop if the value 
of the magnetic flux passing through the loop changes. Moreover, the 


The phenomenon of ele 
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inducted emf is proportional to the rate of change of the magnetic 
flux, i.e., the derivative with respect to time 


aoe where O= Ê B cosa ds. 
dt A) 


We shall show that the law of electromagnetic induction is closely 
related to the existence of a Lorentz force. If the electromagnetic 
induction is due to the displace- 
ment of a wire inamagnetic field, then 
the law of induction follows from the 
expression for the Lorentz force. 

In order to avoid confusion due to 
difficulties of a purely mathematical 
nature, let us simplify the proof by 
assuming that the induced emf arises 
in a rectangular circuit oriented per- 
pendicular to the flux lines ina uni- 
form magnetic field. A change in flux 
produces a translatory displacement of 
one side of the rectangle of length 7 
as shown in Fig. 116. Since there are 
free charges in the displaced con- 


Fig. 116 ductor, these charges are subjected 
to the action of a Lorentz force 
j= ŽB when the conductor moves with a velocity v. In view 


of the fact that the velocity, magnetic field and conductor are 
at right angles to each other, we are able to dispense with the vector 
notation in the formula for the force since the sine of the angle is 
equal to unity. The Lorentz force is directed perpendicular to the 
plane passing through the direction of the velocity v of the charges, 
and hence of the wire, and the direction of the magnetic flux lines. 
Thus, the force is directed along the wire. The charges are impelled 
to move and an induced current is thereby created. 

The electromotive force is equal to the work of moving a unit 
charge around a closed circuit. Since the force acting on a unit 


è 1 
charge is equal to = vB, the work of this force along the moving 


fe 1 

wire is equal to -y YBL. Moreover, no work is performed along the rest 
of the circuit. Hence, the last expression is the expression sought for 
the induced emf. It has the form: 


g = Bl (in the Gaussian system of units) 


6? =vBl (in the electrical engineering system). 
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_ Assume that the wire moves a distance dx during the time dt. 
Thus, the area of the circuit increases by the amount ldx = dS and 


the magnetic flux by the amount dD = BdS. Since v= i, the 


induced emf may also be written in the following form: 2 . But 
a k i 3 , dt 
this is precisely the expression for Faraday’s law of electromagnetic 


. = e i CAUS ~ á F: 
induction, i.e., guu a the Gaussian system of units and 
gind da, 3 p F 

oia in the electrical engineering system. 


dt 
We have thus demonstrated that electromagnetic induction and 


the deflection of moving electric charges in an external field repre- 
sent different aspects of one and the same law of nature. In the 
next chapter, we shall return to this interesting problem. Here, 
it was only necessary to present the essence of the electromagnetic 
induction law. 


106. Measurement of Magnetic Fields by Means of 
Induced Impulses 


The phenomznon of electromagnetic induction is utilised in the 
design of precision instruments for the measurement of magnetic 
fields. Assume that it is necessary to deter- 
mine the value of the magnetic field at some I 
Point in space. A’ small, flat coil or a single 
turn of wire is placed in a magnetic field per- 

Pendicular to the flux lines and the ends of 

the winding are connected by means of leads 

to the terminals of a ballistic galvanometer. 

Now, if the coil is rapidly turned through 

a 90° angle in such a manner that its flat 

Surface becomes parallel to the flow lines, an 

electrically induced current flows in the wind- t 
ing as the coil is being turned. The brief Fig. 147 

llow of current, which rapidly reaches a max- 

imum and then drops to zero, is called an 

induced impulse (Fig. 117). During this brief interval, a certain 
Quantity of electricity flows in the wire. The charge can be very 
accurately measured by means of a ballistic galvanometer, a de- 
Vice having a moving coil with a high moment of inertia that 
integrates the electric current over the period of the impulse. 

_ If the resistance of the coil is R and the number of turns n, the 
induced current strength may be written in the form 


n dO 
UER 


a -y 
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The quantity of electricity flowing in the wire during the period of 
the induced impulse is 


T 2 
Q= \ Idt=R"n f dD = Rn (D,—O;), 
0 4 


where @,; is the value of the flux passing through the coil in the 
first position and ®, is the value in the second position. 

If P; or ®, is equal to zero, i.e., magnetic flux lines do not pass 
through the coil in the initial or final position, the performed meas- 
urement yields the value of the magnetic induction. For this purpose, 
we need only divide the value of the magnetic flux by the coil area S, 
E QR 
ie, B= Ae 

Of course, other methods of measurement are also possible. Thus, 
instead of rotating the coil, the field may be switched on or off. Also, 
to make the effect more pronounced, the coil can be turned through 
an angle of 180° instead of 90°. This doubles the effect. Similarly 
the polarity of the field can be reversed, rather than simply switched 
on or off. 

Since the measuring coil may be made as small as 
metre, measurements by this method enable us to ac 
mine the magnetic field in small regions. r 

This method is also used to measure magnetomotive force. For 
this purpose, we use a measuring device known as a Rogovsky belt— 
a long coil on a flexible belt. The belt m ay be shaped into any desired 
form and its two ends placed at any two points in a given region. 
Also, if desired, the ends of the belt may be brought into contact 
with each other. We shall show that when the field is switched off 
the deflection of a ballistic galvanometer connected to such a meas- 
uring belt is proportional to the magnetomotive force along the path 
of the flexible belt. 

The deflection of the ballistic g 
quantity of magnetic flux passing 
Let n be the winding density, 


a square milli- 
curately deter- 


alvanometer is a measure of the 
through all the turns of the coil- 
i.e., the number of turns per unit length 

of the measuring belt. Then, on a small belt segment Al;, there are 
nAl; turns, and the magnetic flux passing through these nAl; turns 
is equal to D;nAl;. 

If the medium is homogeneous and all the turns have the same 
area, then 

O;=pSH,, 


gnetic flux passing through the entire measuring 


and the total ma 
belt is 


p= x wSnHAl; 
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Taking the limit for Al; — 0, we obtain 


O= MS \ Hdl. 


Heo 


Since the measurement occurs in a medium for which u does not 
significantly differ from po, the quantity uSn is a constant of the 
instrument. The throw of the ballistic galvanometer in these belt 
measurements is exactly proportional to the magnetomotive force 
between the points at which the ends of the belt are located. 

By means of this device, it is easy to demonstrate the validity of 
the laws discussed in Sec. 104. Thus, as long as the coil encircles 
one and the same current, the magnetomotive force will remain the 
same for all configurations. Also, it is easily verified that the mag- 
netomotive force along a circuit not encircling current is equal to 
zero. For the case when the coil encircles a current several times, the 
magnetomotive force can be shown to increase by the corresponding 
number of times, etc. 

It should be emphasised that magnetic field measurements by 
means of induced impulses are of particular importance when we are 
Concerned with the magnetic field inside a solid body. The only other 
method available is to make a slit in the solid body. Such a proce- 
dure is usually not possible. 

Let us consider the most common problem—the measurement of 
the magnetic permeability of an iron body. The most accurate results 
are obtained when the substance under investigation is in the form 
of a toroid. Two windings are wound on such a ring—one connected 
to a current source and the other to a ballistic galvanometer. If 
Current is flowing, a magnetic flux P = BS passes through the 
ring. By reversing the direction of the current through the primary 
Winding, an induced current is produced in the secondary. The 
quantity of electricity Q flowing through the galvanometer is related 
to the magnetic induction inside the ring by a relation already 


Iscussed, namely: 
R 
BS 2 7 


Where § is the cross-section of the toroid (assuming the turns are 
Wound right on the surface of the ring), m2 is the number of secondary 
turns and R ‘is the resistance of the secondary winding. As regards 

he magnetic field intensity, it may be determined by the formula 


for a ring solenoid, namely, H = ie, The magnetic permeability of 
the substance under investigation is then equal to B divided by H. 
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107. Finite Bodies in a Magnetic Field 


To one degree or another all bodies possess magnetic properties. 
These are indicated, first, by the fact that a magnetic field exerts 
forces and torques on bodies and, secondly, by the fact that a body 
placed in a magnetic field distorts the field. As was indicated above, 
the magnetic properties of a substance are characterised by the coef- 
ficient p—the magnetic permeability of the substance. In accordance 
with the value of u, bodies may be divided into three distinct classes: 
ferromagnetic substances—including iron, nickel and cobalt—whose 

relative permeabilities are much greater than unity; paramagnetic 
substances, whose permeabilities are somewhat greater than unity; 
and diamagnetic substances, whose permeabilities are slightly less 
than unity. Typical values are given in the table. 


Substance u kad 
GOPP aTom aE r o a om ar 0.999990 —10-5 
WALETA A E E ae 0.999991 —9x 10-6 
el etanuriaes tege pee teers oe 4.000300 300 1078 
LLC On sagen nre Ee en gence me ny ee 0.999986 —14x 10-8 
UN SCO eye ue eR 1.000079 79x 10-8 


When a diamagnetic or paramagnetic body is placed in amagnetic 
field, the distortion of the field is negligible. On the other hand, when 
a ferromagnetic body is placed in the field, there is considerable 
distortion. > 

The forces exerted by magnetic fields may be detected without 
particular difficulty in the case of paramagnetic and diamagnetic 
bodies. In the case of iron objects, everybody is familiar with the 
fact that magnetic fields exert large forces. 

Let us first consider magnetic forces. A body that does not possess 
magnetic properties becomes magnetic when placed in a field. This 
magnetisation process manifests itself in the acquisition of a mag- 
netic moment by the body. As we know, a system possessing a magnet- 
ic moment may be detected'in two ways. In a uniform field, the 
body tends to become oriented in such a manner that the direction 
of the moment is parallel to the external field. Moreover, in a nonu- 
niform field the body will experience a force tending to move it 
along the lines of force. 

In the case of ferromagnetic bodies 
without difficulty. The magnetic mo 
determined from the formula N 


, the torque may be detected 
ment of the body may then be 
= [MH]. However, we are not 
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usually interested in a body having a particular shape, but are 
interested rather in the substance as such. Therefore, when possible 
the measured value is recalculated on the basis of unit volume. The 
vector directed along the magnetic moment and numerically equal to 
the magnitude of the magnetic moment per unit volume is called the 
magnetisation vector J. Of course, the magnetisation vector can be 
determined without difficulty from the magnetic moment of a body 
only if we are sure that the magnetisation of the sample is uniform. 
This is the case when the sample is in the form of an ellipsoid or 
a degenerate ellipsoid, i.e., a cylinder, plate or sphere (see p. 261). 
That is why such bodies are used in experiments of this kind. 

Determination of the magnetisation vector by measuring the torque 
is easily accomplished for ferromagnetic bodies. Since the torque 
is very small for paramagnetic and diamagnetic bodies, this meas- 
urement is difficult to perform. It is therefore preferable in these 
Cases to measure the forces acting on a body located in a nonuni- 
form field. 

Let us consider a small volume of a magnetic substance located in 
a nonuniform field. For simplicity, assume that the field varies 


É ; aH ; 
along one axis and that the gradient is equal to =. Since a small 


volume of magnetic substance behaves like a magnetic dipole, the 
Potential energy of a unit volume may be written in the form U = 
= —J#H. If the moment acts along the field, the force exerted on 
a unit volume of the magnetic substance is equal to the derivative 
of the potential energy with respect to the coordinate, i.e., 


eS 


Thus, if we know the field gradient, we can determine the magnetic 
moment of a unit volume of the body under investigation by meas- 
uring the force. In practice, there are various ways of accomplishing 
this, the simplest being by means of a so-called magnetic balance. 

thread is passed through an aperture made in one of the pans of an 
analytical microbalance. Then, the sample is attached to the end of 
the thread and suspended between the poles of a magnet. Before and 
after energising the magnet, the sample is balanced; hence, the 

ifference between the readings is equal to the value of the force f. 

The weights must be quite accurate as can be seen from the fol- 
Owing example. A piece of bismuth, a substance whose EE Tenet ke 
Properties are most pronounced, has a magnetisation J of 2 x 10 
Gaussian unit when placed in a magnetic field whose intensity H 
is ~ 41,000 oersteds. If the nonuniformity of the magnetic field is 
=a 50 oersteds/em, a force of only 1 cm will be exerted on each 

f~ 1 dyne/cm*. 


Cubic centimetre of bismuth, i.e., 


-u 
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Experiments show that for diamagnetic and paramagnetic bodies 
the following simple relationship exists between the magnetisation 
vector and the magnetic field intensity: 


J =uxH, 


where x is the magnetic susceptibility. For diamagnetic bodies x 
is negative, while for paramagnetic bodies it is positive. Values of x 


DE, |: pag: 


xE 


a) Diamagnetic body b) Paramagnetic body 


Fig. 118 


are given in the table on p. 286. When x is positive the magnetisation 
vector is in the direction of the field intensity vector, but when x is 
negative, i.e., for diamagnetic bodies, the magneétisation vector 
is opposite to the field. 

Due to this difference in sign, the behaviours of these two classes 
of bodies under identical conditions are completely different. This 
is illustrated in Fig. 118. As can be seen, the differences are quite 
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striking. A paramagnetic body is attracted toward the region of 
strong-field, while a diamagnetic body is repelled from such a region. 
In a uniform field, a paramagnetic needle tends to become oriented 
with its axis along the flux lines, while a diamagnetic needle tends 
to become oriented perpendicular to the flux lines (see the analo- 
gous example in the case of a dielectric, p. 256). 

The determination of magnetic susceptibility by measuring the 
force in a nonuniform field may be accomplished for solid bodies in 
the form of a monocrystal or powder. This method is also easily 
adapted to liquids. In this case, the experiment can be so arranged 
that the measured quantity is the increase or decrease in the level 
of the liquid as it is attracted toward, or repelled from, the region 
between the poles of a magnet. 


108. Relationship Between Permeability 
and Susceptibility 


Both permeability and susceptibility may be determined straight. 
forwardly. The permeability is determined from the formula u = >; 


by measuring the induction and field intensity. The ‘susceptibility 
is determined, as described in the preceding article, from the forces 
exerted on a magnetic substance. To be sure, the relationship between 
these two characteristics of the magnetic properties of a substance 
can be established experimentally. However, there is no need to do 
this since an exact and simple relationship exists between p and x. 
This will now be shown. ‘te 7 

Let us return to the experiment for determining the magnetic 
Permeability of a body in the form of a toroid. The primary winding 
Creates a field of intensity H = g , which is independent of the sub- 
if the toroid were not present, the field 
intensity would be given by the same formula. The situation is 
different as regards the magnetic flux. It can be shown experimental- 
ly that the value of B depends on the magnetic permeability. If 
an iron core is placed in the coil, B becomes hundreds or thousands 
of times greater than in the case of an air core. This increase in mag- 
hetic flux is due to the magnetisation effect. 1 i 

First, it should be noted that the magnetic induction of a ring 
solenoid without an iron core (tof) has the significance of magnet- 
l¢ mom er unit volume. ' ay. 

The news eed of one turn is equal to ZS, for in this discus- 
Sion we shall employ the practical system of units. For the total 
Magnetic moment of the system we obtain nIS, and the magnetic 


Moment in unit volume, es is simply equal to the field intensity. 


Stance of the toroid, i.e., 


19-1409 


eh | 
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The magnetic moment of the equivalent dipoles is yọ times greater 
(cf. Sec. 103). Therefore, the magnetic induction oH of a uniform 
magnetic field created by the turns of a ring solenoid without an 
iron core can be expressed as the magnetic moment of the equiva- 
lent dipoles per unit volume. 

We are entirely justified in assuming that the significance of mag- 
netic induction is maintained if, without disturbing the uniformity 
of the field, the coil is filled with an additional number of magnetic 
dipoles. If J is the magnetic moment per unit volume due to the 
additional dipoles, the magnetic induction increases by this amount 
and becomes 


B=pH-+J. 
Such an increase in B also occurs when the solenoid is filled with 


a magnetic substance. Since J=poxH, then B=po (x + 1)H; hence, 
the susceptibility and permeability are related by the equation 


p=1+x. 


Analogous calculations employing the Gaussian system of units 
lead to formulas with other coefficients. The magnetic moment of 
currents (and dipoles) per unit volume is 


4 
n—Is n 


S a e 


Therefore, in the presence of a medium, 

4 4 3 

m B= Et. ie., B=H+4nJ. 
Letting J = wH, we obtain 


B=(1-+ 45x’) H. 
Hence, 


p=1+4nx', where w =-=. 
ATU 


Example. Let us perform the calculation in the example on p. 287 employing 
the practical system of units. For bismuth, x’ = 2 x 1075, i.e.,x% = 4nw = 
= 8x X 107°. The piece of bismuth is in a magnetic field whose intensity is 


4 
H = 1,000 oersteds=—— x 10° x 1,000 2@Ps _ 10° amps 
AT metre 4x metre’ 


Furthermore, the nonuniformity is expressed by 


dH 50 oersteds =50x zs x103? x 1400 2™Ps_ _ 5408 amps ý 
dx cm án metre? 4n metre? 
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0-6 volt sec 


metre? ` Hence: 


The bismuth magnetisation is given by J = puoxH =81X1 
the force acting on a unit volume (1 metre’) is 


= U ? -e 5X108 _ n newtons 
faJ a O xis =H metres * 


newtons _ , dyne 
metres ~~ em 
the previous example. 


Clearly, 10 , Which agrees with the result obtained in 


109. Distortion of a Magnetic Field Due to the Presence 
of a Magnetic Substance 


The problem of magnetic field distortion has practical significance 
only when the distortion is due to an iron body. To a large extent, 


Fig. 119 


We shall be repeating the analysis given on P. 259 for the analogous 
Case of dielectric bodies. : : i 
At the boundary of two media of different magnetic permeability, 
the magnetic field vectors (induction as well as intensity) are refract- 
ed. To determine the law of refraction, let us first consider the 
Magnetomotive force along a small circuit ABCD whose sides paral- 
el to the boundary surface are close to each other, but on either 
Side of the boundary as shown in Fig. 419. Since no current flows 
through this circuit, the magnetomotive force is equal to zero. 
et us resolve the magnetic field intensity vectors on either side 
of the boundary into normal and tangential components. From the 
igure it is evident that the magnetomotive force can be equal to 
zero only if the tangential components are equal to each other: 
Hy = Hot- 
19* 
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Another condition at the boundary between two media is estab- 
lished by considering the magnetic flux passing through a small cylin- 
der (not shown in the figure) enclosing a portion of the boundary. 
Since the magnetic lines have no sources, the number of flux lines 
entering the cylinder through the top is equal to the number leaving 
through the base. The lateral surface has an infinitely small area and 
the flux through it is equal to zero. Now, let us resolve the magnetic 
induction vectors on either side of the boundary into normal and tan- 
gential components. Clearly, the flux entering the cylinder can be 
equal to the flux leaving the cylinder only if the normal compo- 
nents of the induction 
vectors do not change in 
crossing the boundary: 
Bi Bon. 

From these two rules, 
we can determine the 
law of refraction for flux 
lines. It is evident from 
the figure that 


tangy _ MU 
tango umg ` 


In passing from air into 
iron, the magnetic flux 
lines are deflected from the normal to a considerable extent and as 
a result the flux density sharply increases. For this reason, an iron 
body, whose permeability is hundreds or thousands of times greater 
than uo “absorbs”, flux lines. This is the basis of magnetic shield- 
ing. Magnetic flux cannot, in effect, penetrate into a region 
bounded by iron since practically all of the magnetic lines enter 
the-iron (Fig. 120). 
The distortion of 
specified shape is d 
the case of a dielectr 


a magnetic field due to a magnetic body of 
etermined in exactly the same manner as iD 
; ic. If the iron body has the form of an ellipsoid, 
cylinder or plate, theoretical calculations show that the field inside 
the body will be uniform if the field was uniform before the iron 
was introduced. A relationship completely analogous to that given 
in Sec. 96 exists between Hy, the external uniform field before the - 
body is introduced, and H;, the field inside the iron after the body 
is introduced. The field intensity inside the iron body becomes less 
Poe original intensity by an amount proportional to the magnet- 
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In order for the demagnetisation factor to be dimensionless, it is 
necessary to divide the magnetisation by the magnetic permeability 
of vacuum. Continuing with the practical system of units and sub- 
stituting 


J =p (u—1) Hi, 


we obtain the following relationship between the external and inter- 
nal fields: 


H=: 
In the Gaussian system of units 
BeNi: 
PEN 


and the relationship between the external and internal fields is 
given by 


Hie ‘ 


N 
I+) z 


The demagnetisation coefficient has the same value as in the case 
of dielectrics: 


n= (v =) 


for a.sphere, N = 4x (N’ = 1) for a plate, etc. 
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y of iron, we may have created the 


In discussing the permeabilit 
i k p agnetic sub- 


alse impression that the magnetic properties of ferrom 
stances differ from those of paramagnetic substances only as regards 
the magnitude of the permeability. This is by no means the case. 
Ferromagnetic bodies differ from the others mainly in that the mag- 
netic state of such a body is not linearly dependent, and moreover 
1S not uniquely dependent, on the magnetic field intensity. Therefore, 
the concept of permeability for ferromagnetic substances is very rel- 
ative, The magnetic properties of iron are best illustrated by a mag- 
netisation vs. field intensity curve or a magnetic induction vs. ficld 
tensity curve. These curves are very similar to each other. 

Let us consider the magnetisation of an iron body as a function’ of 
the field intensity. At first, the magnetisation increases slowly, then 
tapidly, and finally magnetic saturation sets in. Such magnetisation 
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curves were first used by A. G. Stoletov and are characteristic of all 
ferromagnetic bodies (Fig. 121). We reiterate: the magnetisation 
and magnetic induction curves are very similar. The slope of the 
magnetisation curve gives the magnetic susceptibility, while the 
slope of the induction curve gives the magnetic permeability. From 
the figure, it is seen that the per- 
meability (also susceptibility) 
15000 curve has a maximum. For weak 
i fields the permeability is low. As 
E il -the field intensity is increased, u 
10000 increases to a maximum, then be- 
Fi gins to drop, and after reaching 

saturation remains unchanged. 


5000 i When the value of the permeabil- 
=| | ity is given without specifying 
the external conditions, the maxi- 

ie mum permeability 


is usually 
O 20 40 60 80 100 120140 H,ajem meant. 


Fe gaussicn/a But there is more to be said 
about the behaviour of ferro- 
magnetic substances. Let us assume 
that after the iron has been 
brought to a state of magnetic satu- 
ration the magnetic field intensity 
is decreased. It turns out that 


B, gauss = 


20 40 60 80 100 120 140 H,ajem 


Fig. 124 higher than the initial magnet- 


the residual magnetisation, the pol 
the experiment discussed on p. 285, it would 


5 i l E direction, the body begins to become 
magnetised in the reverse direction, i.e., what was previously a south 
The magnetic flux increases until 
ation is the same as in the initial 


- Having attained a negative induction 
proceed with the process in the other dire 


maximum, we may then 
hysteresis loop shown in Fig. 122 is obtai 


ction. In this manner, the 
ned. 
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It is seen from the figure that knowledge of the intensity of the 
field in which the iron is located isnot sufficient to determine the mag- 
netic induction and, hence, the magnetic permeability. Thus, for 
example, it is seen that three values of induction are possible for 
H = 400 oersteds—one occurs during initial magnetisation, the 
second during demagnetisation and the third when the magnetisa- 
tion process is repeated, i.e., just before the hysteresis loop is closed. 
The value of the magnetic induction, and the magnetic permeabil- 
ity, depends’ on the previous “history” of the sample. Hence, the 
designation “hysteresis loop”. 

A hysteresis loop is usually drawn on the assumption that the fer- 
romagnetic body is brought to magnetic saturation. However, we can 


B; gauss 


Hoersted 


Fig. 122 


clearly obtain numerous hysteresis loops having smaller dimensions 
y inscribing them, as it were, in the fundamental loop. For this 
Purpose, it is necessary to begin the demagnetisation process before 
reaching saturation. Then, to each H value there corresponds an 
infinitely large number of values. , 
A procedure based on this fact is used to bring a ferromagnetic 
ody to a state in which both induction and field intensity are equal 
to zero. This “zero point” is achieved by a series of successive magnet- 
ie reversals, whereby each succeeding cycle is begun at a lower 
intensity level than the previous one. : 
he magnetic state of iron cannot be characterised only by the 
Value of the permeability, field intensity or induction. Two of these 
quantities must be known—e-g-, the induction and the intensity. 
he magnetic state of the iron is then represented by a point inside 
the fundamental hysteresis loop- 
he nature of a hysteresis loop depends to a great extent on the 
Material. A body is said to be magnetically soft if the coercive force 
and hence the loop area) is small. Typical soft materials are pure 


as, ; 
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iron, silicon steel, and iron and nickel allo 
loy—78% nickel). Carbon and other steels b 
hard materials and are used in the manufa 
nets. 

Experiments show that the temperature of a ferromagnetic sub- 
stance rises when it is subjected to magnetic reversals. This is very 
important in electrical engineering, for when iron is placed in a var- 
iable magnetic field the point on the B — f (H) curve representing 
the magnetic state of the iron is continuously tracing a hysteresis 
loop. Every time a loop is traced a certain amount of heat is released, 
which according to magnetic field theory is related to the loop area; 
of course, the lower the value of the induction maximum, the smaller 
the loop area. Therefore, empirical formulas may be sought relating 
heat released and maximum induction. In electrical engineering. 
for example, the following formula is widely used: 


1.6 
Q > NBriax, 
where 7 is a coefficient whose value is given in tables, 


ys (particularly permal- 
elong to the magnetically 
cture of permanent mag- 


Example. For a good transf 
= 10,000 gausses, the losses are e 
= 2.5 X 1074 joule/em3. 


to an alternating current 
to 12.5 x 1073 w: 


ormer steel, ņ = 0.0011. When Umar = 
are equal to Q = nB{;$, = 2.5 X 10? ergs/cm? = 
This means that for magnetic reversals in iron due 


whose frequency v is 50 cps the power loss is equal 
att per cubic centimetre of iron, 


CHAPTER XVI 


ELECTROMAGNETIC FIELDS. MAXWELL’S EQUATIONS 


441. Generalisation of the Law of Electromagnetic Induction 


i In the preceding chapter it was shown that the motion of a conduc- 

If ina magnetic field is accompanied by induction phenomena. 
this moving conductor is part of a circuit through which the mag- 

netic flux changes when the conductor moves, a current correspond- 

i s 1 d® e s 3 A 

ing to the induced emf = >| flows in the circuit. This current 

: ; 1 

is due to the action of a Lorentz force: a force equal to = [vB] acts on 


a unit electric charge. 

The induced current depen 
of the conductor with respect to t 
asserted with equal validity that 
a charge moves in a magnetic field o 
and the magnetic field moves. This 
relativity. 

f Consider a system of coordinates relative to which a magnetic 
Held moves. Such a coordinate system may be fixed, for example, 
relative to a laboratory bench along which the pole of a permanent 
Magnet moves. Then, a Lorentz. force will act on charges at rest 
telative to this bench. Let us assume nothing is known about the 
Moving permanent magnet. Having established that a force acts on 


the stationary electric charges, we are perfectly justified in conclud- 
s system whose intensity 18, 


ing that an electric field exists in thi ( 
equal to the Lorentz force divided by the magnitude of the charge. 
hus the electric field intensity in the “stationary” coordinate system, 


relative to which the source of constant magnetic field moves with 
Velocity v, is expressed by the formula 
4 
E=+ WB}. 


ds only on the relative displacement 
he magnetic field. Thus, it may be 
a Lorentz force is produced when: 
r when the charge is “at rest” 
follows from the principle of 


Of course, the relations differ for the electric field created by 
Charges and the electric field created by the motion of the system rela- 
tive to the magnetic field. To begin with, this new field that we are 
Considering has no charge sources. This means that the flux lines 
nave neither beginning nor end. Moreover, it is not difficult to see 
that the flux lines of this electric field form closed curves, i.e., the 
electric field created by the moving magnetic field is rotational. 
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Imagine an arbitrary circuit that is stationary relative to the lab- 
oratory bench. The moving magnetic field crosses this circuit. I 
this imaginary circuit is replaced by a real wire circuit, then in 
accordance with Faraday’s law an emf is induced in the circuit that 


is equal to § ar. Hence, the integral U = $ Eat is not egual to 


zero. This means that the electric field B = 2 [vB] created by the 
moving magnetic field is a rotational field. 

For a real wire circuit U = LE ; where @ is the magnetic flux 
passing through the circuit. However, it is immaterial whether or 


not wire is present at the location of the closed curve. The equation 
1d 


= 3 is also valid for an imaginary circuit in the region where 
the sources of the magnetic field are moving. 

One final generalisation remains to be made. Experiments show 
that the cause for the change in the magnetic field is of no importance 
in the induction effect. The field change due to the motion of a per- 
manent magnet and that due to a change of current strength in a sta- 
tionary coil can always be made equal by, for example, bringing the 
permanent magnet closer or increasing the current in the coil creat- 
ing the field. Therefore, the law under consideration must be valid 
in all cases, no matter how the magnetic field is changed. Thus, if 
the magnetic field (magnetic flux) changes in a certain region in 
Space, a rotational electric field is produced that is related to the 
magnetic field change as indicated by the following law. The elec- 


tromotive force J = § Ea along a closed curve is equal to the 


derivative with respect to time of the magnetic flux passing through 
this circuit: 


4 do 
è dt 


or, in the practical system of units, 


This is the generalised law of induction, one of the most important 
laws of nature. 


Let us examine the mathematical expression for this law. Equating 
the expressions for the electrom 


r otive force and the magnetic flux, 
the law can be written as follows in the Gaussian system of units: 


f Edt = m f B cosg ds, 
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and in the practical system of units 


$ Edl = =£ f B cosg dS 

First, as regards the minus sign that appears in’ this equation, it 
should be noted that in vector analysis the direction in which the 
Circuit is traversed and the direction of 
the normal to the plane of the circuit are 
related to each other as follows: the posi- 
live direction of the normal in a right- 
handed screw system is such that as viewed a 
from the vector terminus the circuit ap- 
Pans to be traversed in the counterclock- 
ase) direction (Fig. 123). Let us construct 
a closed curve in space and ascribe an 
arbitrary direction to it. The direction of 

1e normal to the area encompassed by 
TA curve under consideration is thus 
ne rmined. Magnetic flux passes through 

lis Circuit. At a given instant, it may Fig. 123 
So ative or negative, depending on 5 
he her the induction vector forms an acute or obtuse angle with 

normal. The derivative of the flux with respect to time is positive 


n 


Positive 
direction 


Positive 
direction 


@® decreases @ increases 


Fig. 124 
x the flux is increasing and is negative if the flux is decreasing. Thus, 
aking the minus sign into account in the induction formula, the 
ie may be stated as follows: The electromotive force is positive 
Positive flux is decreasing or negative flux is increasing, i.e., the 
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direction of the electric lines of force coincides with the adopted 
direction for positive. On the other hand, the electromotive force 
is negative if positive flux is increasing or negative flux is decreasing. 
These relationships are clearly illustrated in Fig. 124. 

We shall show that the minus sign in the induction formula is 
the mathematical expression of Lenz's law. Assume, for example, 
that the north pole of a bar 
magnet approaches a coil and 
= that the positive direction in 
| z \ Positive the circuit is as indicated in 
|. direction Fig, 125. Then, the magnetic 
flux is positive and so is its 
derivative with respect to 
time. The electromotive force 
must be negative and the 
induced current is opposite to 
the direction adopted as positive. We can immediately find the 
magnetic field of the induced current by recalling that the flux lines 
emerge from the side of the current ring from which the current 
appears to be moving in a counterclockwise direction. Therefore, as 
the magnet approaches the circuit, a current of such direction is 
induced in the latter that the field produced tends to oppose the 
action which caused it. This is a statement of Lenz’s law. It is not 
inet to verify this important rule for other particular cases as 
well. 

Let us sum up. A varying magnetic field is inseparable from an 
electric field. Moreover, it is seen that the division of fields into 
electric and magnetic is a relative matter. From one viewpoint there 
is only a magnetic field in space. From another viewpoint in addi- 
tion to the magnetic field there is an electric field. 

_A rotational electric field consists of electric lines of force that 
link the magnetic induction vectors when the magnetic flux passing 
through a closed line of force changes with time. When the flux 


increases, the direction of the line of force is clockwise as viewed 
from the induction vector terminus. 


Fig. 125 


112. Displacement Current 


Electromagnetic field theory, whose found 


day, was mathematically perfected Hythe Bach laid: 
| glish scientist James 
Clerk Maxwell. One of Maxwell’s most aA mew ideas WaS 


that symmetry must exist in the interdependence between magnet- 
ic and electric fields. 


ation was laid by Fara- 


In the preceding article, we discussed the problem of creating an 
electric field by varying magnetic flux. The question naturally 
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arises: Does a variable flux of electric lines of force create its own 
magnetic field? Maxwell answered this question in the affirmative 
and advanced the hypothesis that a relationship exists between 
variable electric flux and a magnetic field that is quite analogous to 
the generalised law of induction. According to the hypothesis, if 
a change in electric flux occurs in some region of space, a rotational 
magnetic field is created. Moreover, the magnetomotive force U 
taken along a closed curve is equal to the change in electric flux 
passing through this closed curve, i.e., 
aN 

U= 3 

where 


U=f Hdl 
and the electric flux 


N= $ D cosg dS. 
S 
In the Gaussian system of units 
1 dN 
U Gi 
_ The parallel in the relationships between magnetic and electric 
fields does not extend to the sign before the derivative of the flux. 
As is known, when currents are present the magnetomotive force 


4n 7. . 
along a closed curve is equal to U = I (or = I in the Gaussian system 


of units). How should the equation for magnetomotive force be writ- 
ten for such a closed curve that encloses electric current and variable 
flux of electric lines of force? Maxwell assumed that the magnetomo- 
tive forces are additive. Thus, the general formula has the form 


a aN 
$ Hal =I +- 


or in the Gaussian system 


§ mat => [ ar) 0 


The expression Ny asthe dimensionality of electric current strength. 
dt 


Maxwell called it displacement current. He thereby incorporated 
in this designation the very widespread notion at the end of the 
nineteenth century that the field in a vacuum displaces the particles 
of an “ether” from their positions of equilibrium. ‘This designation 
has continued to prevail in science, although now the presence 
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of a field in vaccum is not related to the concept of particle displace- 
ment of any medium whatsoever. In a dielectric medium, the 


displacement current a may be resolved into two components, cor- 


responding to the intensity and polarisation vectors into which the 
displacement vector D can be resolved (see p. 255). Therefore, the 

portion of the displacement current “flow- 
(aN ing” in the dielectric is determined | by 
\ 1 the change in the polarisation vector, i.e., 
| by the relative displacements of the centres 
| I of gravity of the positive and negative 
Nae charges. 

Before discussing the role of displacement 
current in one or another process, we shall 
prove an important proposition concerning 
the sum of the conduction and displacement 
currents. 

Consider an arbitrary system of, electric 
currents and imagine a closed surface 
drawn in such a manner that the currents intersect it. If the cur- 
rents are constant it follows directly from the law of conservation 
of electricity that the sum of the currents entering the closed sur- 
face must be equal to the sum of the emerging currents, or, to be 
more concise, the algebraic sum of the currents flowing through a 
closed surface is equal to zero. It is evident that this law may not 
be obeyed by variable currents: for example, in the case of a closed 
surface enveloping one plate of a condenser connected in an alter- 


nating current circuit (Fig. 126) or a closed surface through which 
the top of an antenna protrudes at one point. 


However, this theorem is valid for variable currents 
the term “current” is taken to mean “total current”, i.e., the conduc- 
tion current plus the displacement current, rather than the conduc- 
tion current alone. To prove this, it suffices to consider an arbitrary 


curve on which a surface is based and for which the following relation 
is valid: 


Fig. 126 


as well if 


§ Hdl =I Haien. 


By gradually reducing the closed curve to zero, the surface S on 
which this circuit is based becomes closed as shown in Fig. 127. 
(This process is similar to constricting the opening of a draw-string 
pouch.) The magnetomotive force reduces to zero and hence the sum 


of the conduction and displacement currents passing through the 
closed surface also equals zero. 


Now let us discuss the role of 


" displacement currents in electro- 
magnetic phenomena. 
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It can be shown that displacement currents are negligibly small 
where the conduction currents are different from zero. Hence, the 
displacement current within a conductor is always disregarded. 

In calculating the value of the displacement current in dielectrics, 
two cases should be considered—displacement currents in a dielec- 
tric surrounded by a conductor forming a closed circuit and displace- 
ment currents that are a continuation of a conductor of an open 
circuit. 

Consider a conductor forming a closed circuit in which electric 
current is flowing and which is intersected by a closed surface. If 


H 


the current is constant, then at each instant the same amount of 
electricity passes outward through the surface as inward. The situa- 
tion is different in the case of variable currents. Here, the strength 
of the variable current may have different values in different parts of 
the circuit (see below, p. 322). As a result, the strengths of the cur- 
rents passing inward and outward through the surface at a given 
instant may not be equal. Then, a displacement current flows 
via the dielectric from the point where the current is less to the 
Point where the current is greater, and in this manner the current 
eficit is compensated for. Clearly, changes in the displacement 
Current with respect to time will exactly correspond to changes in 
he conduction current. This phenomenon is significant only when 
the frequency of the current is sufficiently high. igh x 
the conduction current does not form a closed circuit, e.g., in 
he case of an alternating current circuit containing a condenser, 
1e conduction and displacement currents are simply equal to each 
other. In this case, it may be said that the conduction current cir- 
Cuit is ¢ isplacement current. ‘ 

In oe i we ate the magnitudes of the displacement cur- 
rents are quite large for such cases, certain calculations may be 
Performed without considering them. Thus, when the conduction cur- 
Tent circuit is closed by the displacement current between the con- 
€nser plates, the magnetic field created in this region by the displace- 
Ment current is the same as the field that would have been produced 
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if the conduction current*flowed in an uninterrupted circuit. There- 
fore, the presence of displacement current does not affect the calcu- 
lation of the magnetic field, the coefficient of self-induction ofa 
system, etc. 


143. Nature of an Electromagnetic Field 


The following equations, which were discussed in the two preced- 
ing articles, are called Mazwell’s equations: 


§pat=— and $ Ha =I. 
These equations concisely sum up our knowledge of electromagnetic 
fields. f 

Maxwell’s equations cannot be derived. The discussion of the pre- 
ceding two articles does not constitute a derivation, but constitutes 
rather an illustration of conjectures leading Maxwell to his discov- 
ery. 

A large class of phenomena of interest to physicists, electrical engi- 
neers and radio engineers obey Maxwell’s equations. The laws of 
these phenomena are a consequence of Maxwell’s equations and may 
be derived from them. The extraordinary importance of the predic- 
tions based on Maxwell's equations gives these equations equal rank 
with Newton’s laws of motion and the principles of thermodynamics 
as fundamental laws of nature. 

We shall not go into the mathematical methods of solving Max- 
well’s equations. It turns out that the above integral equations may 
be transformed into differential equations. Then, by solving Max- 
well’s differential equations, it is possible in principle to determine 
ea field for a given distribution of charge and cur- 
rent. 

Let us again consider the physical essence of electromagnetic 
phenomena as given by Maxwell’s equations. It may be summa- 
rised in the following manner. 

The division of an electromagnetic field into electric and magnetic 
fields has only relative meaning. If from the viewpoint of an inertial 
system of coordinates only a magnetic field exists, then from the 
viewpoint of another system moving relative to this system there 
exists an electric field in addition to a magnetic field. The converse 
is also true, namely, if an observer in one system of coordinates 


finds only an electric field present, then an observer in another 
inertial system will find that both an electric field and magnetic 
field exist. 


Let us now consider an electromagnetic field from the viewpoint 
of an inertial frame of reference, first directing our attention to â 
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region of space in which free electric charges and hence conduction 
currents are absent. In this case, Maxwell’s epuations have the form 
: aN a d® 
$ Hdl =~ and ẸEdl= — r > 
Both the magnetic and electric fields have a purely rotational charac- 
ter, i.e., the lines of force are closed and mutually interwoven: 
electric lines encircle magnetic lines and magnetic lines encircle 


ue =S=S' 


Fig. 128 


electric lines. The electromagnetic field may be depicted as a chain 
of rings, whereby closed magnetic lines of force are alternately linked 
with closed electric lines of force (Fig. 128). Such a chain exists only 
if the field is variable: A ring of increasing magnetic flux creates 
about itself a ring of electric flux; the varying electric field creates 
a ring of magnetic flux, etc. à 

the reet space under consideration contains charges ae 
Currents, then in addition to rotational fields with linked a 9 
force there exists a rotational magnetic field whose closed MoE a 
encircle currents and a potential electric field whose flux lines begin 
On positive charges and terminate on negative charges. 


~ 


20—4409 


CHAPTER XVII 


ENERGY TRANSFORMATIONS 
IN ELECTROMAGNETIC FIELDS 


114. Transformations in Steady Current Circuits 


Let us consider a portion of a conductor through which a steady 
electric current is flowing. If the resistance of the conductor segment 
is R and the potential difference across it is U, the current strength 
is determined by Ohm’s law, namely, J ang. The electric field 
performs work in moving charges along the circuit and the work 
per unit charge is equal to U. Since the current strength is, by defi- 
nition, the quantity of electric charge flowing through the conductor 
cross-section per unit time, the product JU yields the work per- 
formed by the field in moving the electric charge per unit time. This 
product, JU, represents power. If the current is a steady one, this 
work is completely converted into heat (thermal energy). Thus, 
the formula for calculating the thermal effect of current is 


. U2 
IU =a SPR. 


The transformation of the work performed by the electric field 
into heat occurs at each point of the conductor. To express this mathe- 
matically, Ohm’s law must be converted into a form applicable to 
a point of a conductor, rather than to a portion of a conductor. By 


introducing the current density j, which is equal to a where 5 


is the cross-section of the conductor, and by repl 
sion for the potential difference by Æl 
the resist 


acing the expres- 


$ , and, finally, by expressing 
ance in terms of the conductor length J and its cross-section, 


Teh ye — we obtain: j = AF. 


Thus, it may be said that the current density is directly propor- 
tional to the electric field intensity. The specific conductivity A 
is the coefficient of proportionality and the direction of the current 
is assumed to coincide at each point with the direction of the inten- 


sity. The formula 
J=)hE 


is called the differential form of Ohm’s law and should be viewed as 
an empirical law generalising the laws for current flow in conduc- 
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tors. Ohm's law in its usual (integral) form is a consequence of this 
equation. 

Consider an infinitely small volume element of the conductor, 
dz, in the form of a cylinder whose generatrix dl is parallel to the 
flux lines and whose base dS is perpendicular to the current. The 
amount of electric charge flowing through a cross-section of the 
cylinder is j@S and the potential difference between the ends of the 
element is dl. Hence, the work performed by the field in moving 
the electric charge through this volume is equal to j£dv. This for- 
mula also gives the heat released inside the volume dt. If we are 
interested in the work of the current in a large volume of the con- 
ductor, the last expression must be integrated. The formula 


=)E? 


gives us the expression for the work of the current or the thermal 
energy released per unit volume of the conductor. 

Thus, in the case of a portion of a direct-current circuit, the energy 
transformations are reduced to the transformation into heat of the 
work done by a field. However, the picture changes as regards the 
energy balance of the entire closed direct-current circuit. The work 
performed by the electric forces along a closed curve when the field 
is constant is equal to zero, for the work performed by the electric 
orces in moving charge along the external portion of the circuit is 
equal and opposite to the work required to moye the charge along 
the internal portion of the circuit. Therefore, the release of thermal 
energy in a direct-current circuit occurs only at the expense of the 
energy supplied by the current source—accumulator, electric gener- 
ator, etc.—i.e., at the expense of energy of nonelectrical origin or, 
as it is sometimes said, at the expense of the energy of an “applied” 
force. The role of electric current is reduced simply to the “transfer” 
of energy from the current source to the point where the heat is 
released. The energy that a source is able to supply is given by the 
electromotive force @, which by definition is measured by the work 
Performed in moving a unit charge along a closed curve. Actually, 
the applied emf performs this work only over those small portions 
of the circuit where the charge must be moved against the forces of 


an electric field. eae te score 
he power of the direct-current circuit is given by the expression 
Ig of unit volume if it is assumed 


. This may be expressed in terms 
that the applied foe are distributed throughout the volume. Then, 


the work performed by the applied forces is given by 


jeer, 
20* 
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where Ẹerrl is the “intensity” of the applied forces. 

Designating the work of the applied forces by P, and the thermal 
energy released by Q, the essence of the electrical transformations 
in a direct-current circuit may be expressed by the concise formula 


P~Q=0. 


Example. For an isolated copper wire of cross-section S = 4 mm?, the per- 
missible current density in the case of an open wire is j = 9 X 10° amps/metre®. 
A 1-metre length of such a wire has a resistance of 4.25 x 10-8 ohm. For the 
indicated value of j, a current~of J = 36 a flows in the wire and the thermal 
energy losses per second for this portion of the circuit amount to I?R = 


= 1,296 X 4.25 x 107? = 6 joules, i.c., in a unit volume 0.33 cal are re- 
leased each second. 


115. Transformations in a Closed Circuit of Variable Current 


A flow of variable current is inevitably accompanied by induction 
effects. Thus, to a variable current strength there corresponds a vari- 
able magnetic flux ®. Here, © represents the number of lines of 
force which are created by the current circuit and which pass through 
the conducting circuit. In this case, the induction effects are due to 
the current’s own magnetic flux, whence the designation -self-induc- 
tion. Since @ is continuously changing, an induced emf a 


— -r exists in the current circuit at each instant, in addition 
to the applied emf. 

The magnetic flux is always proportional to the current to the 
first power. Hence, the formula Ð = LI has universal validity. The 
coefficient L is called the inductance of the circuit or the coefficient 
of self-induction. The value of L depends on the geometric properties 
of the circuit and the nature and distribution of magnetic bodies in 
the system. It does not depend on the conditions under which the 


system of conductors and magnetic bodies operate. Thus, for self- 
induced emf the following equation holds: 


pind __ a 
Ca ar 


The significance of the minus sign in this formula may be explained 
as follows: When the current increases, the induced emf opposes the 
applied force, i.e., the induction is in the direction opposite to the 
applied force. On the other hand, when the current decreases, the 
directions of the induced emf and the applied emf coincide. This 
is the reason for the analogy generally made between mechanical 


inertia and self-induction. Self-induction impedes an increase as 
well as a decrease in current.. 


115. Transformations in a Closed Circuit of Variable Current 309 


Ohm’s law, relating emf and current strength, remains valid. 
Therefore, the product of current strength and total circuit resistance 
will at each instant be given by the following relation: 

Pa i dl 
TR= cappl gind — pappl = y ee 
o +6 é rea 
Multiplying both members of the equation by the instantaneous 
current strength, we obtain the energy equation: 


PR=160™"— LI. 


Here, grr! — P is the work of the applied forces and J'R = Q 
a the thermal energy. It is seen that in a variable current circuit 
hese two quantities are not equal to each other. The difference 


P— Q is equal at each instant to te i.e., it is equal to the deri- 


vative of 4 LI?. In other words, the excess of the work of the applied 
forces over the thermal energy released goes to increase the magni- 
tude of TTE On the other hand, the excess of the heat released over 
the work of the applied forces occurs at the expense of the magnitude 


1 3 
of 5 LI?. The equation 


poni 
is the expression for the law of conservation of energy. 
Clearly, the quantity W = SLE represents energy. It is the mag- 
netic energy of a system that is inseparably linked with the existence 
of a magnetic field in itv (In the Gaussian system of units the expres- 


Sion for magnetic energy is ŻLE) There is magnetic energy in a 
. ce á I . 
Girect-current circuit too, but in this case it does not manifest itself 
ee it remains unchanged. The induction effects occur only when 
49 current is switched on and off. When the circuit is closed the 
aPplied forces perform work, which is expended not only in the 


release of heat, but also in the storage of magnetic energy. On the 


Other hand, when the circuit is opened the thermal energy released 


R „At the expense of the magnetic energy of the current. 

č he magnetic energy formula may be verified experimentally by 
osing or, even better, opening a current circuit. The thermal energy 

Pleased after the source is disconnected is numerically equal to 

vhe magnetic energy of the current. If the coefficient of self-induction 

's large, the release of heat continues over A period of time sufficient- 
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ly long to enable us to measure the heat by, for example, calorimet- 
ric means. 


Inductance may be measured in various ways and in the simplest 
D z 
cases may be calculated from the formula L = FT The problem is 


reduced to the calculation of the magnetic flux passing through the 
system. h 
An expression for the inductance of a ring solenoid will be required 
below. The magnetic flux ‘through one turn of a coil is ® = 
= Wow S, where § is the area of the turn, and the flux through n 
turns is Ð = nuou HS. Substituting the expression for the field 
intensity (using the practical system of units), we obtain 


I 
D =npousS a K 


Now, dividing both members of this equation by the current strength, 


we obtain an expression for the inductance of a coil (also approxi- 
mately valid for a straight solenoid): 


n2 


L= pup = S. } 


The inductance of a coil is directly proportion 
permeability of the medium and increases sh 
of turns. An increase in inductance is achieved by using iron or by 
increasing the number of turns. In order to make clear the relation- 


ship existing between the coefficient of self-induction and the dimen- 
sions of the coil, let us multiply the numerator and the denomina- 
tor by l. Then, 


r-n (8)'7 


and it is evident that the induct 
volume occupied by the m 
squared. 


al to the magnetic 
arply with the number 


ance is directly proportional to the 
agnetic field and to the turn “density” 


Example, Consider a lon 
= 1,500 turns, S = 1 cm? 
magnetic flux is D = 
solenoid is 


g solenoid of small cross-section 
and J= 0.1 a 


nyo S = 6x x 4 


(l= 15 cm, n= 
). At the centre of the solenoid, the 
0? volt sec. The inductance of this 


n? à, 1,500)2 
L= pop- S=4n x 10-7 x 4 x u a X 10-4= 1.9 x 10-3 henry. 


In practical units the inductance is measured in henrys (1 henry = 1 ohm 


sec). The inductance of coils in radio engineering is measured in millionths 
and thousandths of a henry. Chokes hay 


ing iron cores can attain inductance 
values of the order of a number of henrys. 
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116. Magnetic Energy of a Field 


In the chapter devoted to electric fields, it was shown that the 
electric energy of a system may be viewed as a quantity whose den- 
sity distribution is represented by + sE? (in the practical system of 
units). The electric energy of the system is then determined by inte- 
grating this expression over the region occupied by the field. The 
importance of this circumstance was emphasised as it enables us to 
express the energy in terms of the field intensity and it confirms our 
conception of a field as something that can be localised. 

_ Naturally, we expect the situation to be similar for magnetic 
fields and this is indeed the case. It can be mathematically shown 


A 1 
that the transition from the magnetic energy formula, Fh to 


the expression for the magnetic energy density, + poH, is complete- 


ly analogous to the corresponding transition from electric fields. 
Let us consider this transition for the simple case of the uniform 
field of a ring solenoid. Substituting the expression for the inductance 


in the magnetic energy formula, we obtain 


vt (4) 
Wu= 7 Ve 


But a is the field intensity. Hence, the magnetic energy of a coil 
may be written in the form 


uH? 
Wu= tate Y, 


So that the magnetic energy density is given by the expression 


1H? 
wM= mane t 


In the Gaussian system of units 
4 9 
UM =a uH. 


f currents, the magnetic energy may be 


Thus, for any system o d by the field: 


represented by the integral over the volume occupie' 
Wy = wi? dt (in the practical system of units) 
2 
and 


Wu=e f uH? dt (in the Gaussian system of units). 
JT 
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Now, consider the magnetic energy of two currents. The expres- 
sion for this energy naturally divides into three integrals if the 
intensity of the resultant field H is viewed as the sum of the field 
intensities of the two currents: H = H; + H». In the following 
expression for the magnetic energy, the significance of each of the 
integrals is quite evident: 


w= to f uH? dt po f uH, H, dr +10 f uH? dr. 


The first and third integrals yield the magnetic energy of the first 
and second currents, respectively, while the second integral repre- 
sents the interaction energy of the two currents. This last integral 
may assume different values even though the magnitudes of the 
field intensities, H, and H, do not change. Thus, if the mutual 
disposition of the two-currents changes, the field vectors H, and Ho 
are turned, generally speaking, relative to each other and the value 
of the interaction energy also changes. 

Of course, the first and third integrals may be expressed in terms 
of the current strength and the inductance: DLE and DET. As 


for the second integral, it is clear that its value is proportional to 
the product of the current strengths. Thus, hd 


WH Hy dt = MII. 


The coefficient of proportionality M is known 
mutual induction. Just as in the case of inductance, M depends on 
the geometry of the system and the distribution of magnetic bodies. 

Thus, it is evident that the change in the magnetic energy of a 
system of currents is related not only to the work of the applied 
forces and the thermal energy released, but also to the work per- 
formed by the field in moving the conductors under the action of an 


Ampère force. Hence, the law of conservation of energy requires 
that the following equation be satisfied: 


as the coefficient of 


dW = —A—(Q—P) at, 


where A is the mechanical work. Thus, it may be stated that, in the 
general case, the magnetic energy expended is equal to the work of 
moving the conductors and to the excess of released thermal energy 
over the work of the applied forces, 


The relations introduced in this article do not take into account 
one phenomenon—magnetic hysteresis. This problem will not be 
dealt with because of its specialised nature, 
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Example. The energy stored in the magnetic field of the coil described in 
the example on p. 310 is 


© LI 1.9 10-8 (0.12 À 
Wy==-- 2x 0.1)" — 0.95 x 10-5 joule. 


The energy density is 


2 “(1,000)2 } 
Wyr = Holt fore 4n xX 10-7 X 1X 1,000)" =0.63 joule/metre’, 


Naturally, the same result could be obtained by dividing the total energy of 
the magnetic field by the volume of the coil: 


117. Electrice Oscillations 


The processes of transforming electric energy into magnetic ener- 
gy and vice versa are of fundamental importance in electrodynamics. 
A simple system in which such transformations occur Is a charged 
electric condenser whose plates are 
connected at a certain instant to the 
ends of a coil (Fig. 129). When the t 
condenser discharges, an electric cur- 
rent flows through the coil and creates 
a magnetic field around it. At each c 
instant, the electric field of the condens- 5 
er and the magnetic field of the coil 
are closely linked. The energy of this 


System at each instant is equal to the , 
energy of the electric field, which is concentrated mainly between 


‘he condenser plates, and the energy of the magnetic field, which 
is concentrated mainly inside the coil. As is well known, lee 
oscillations arise in such a circuit and we shall now show that such 
Oscillations are inevitable. 


l To begin with, let us disregard thermal energy losses. Then, the 


aw of conservation of energy requires that the following equation 


e satisfied: 


Fig. 129 


1 @ , 1 772—const. 
PAEA may 
and the magnetic energy is the same 


The 
Sum of ric energ x p 
ee ay e of the above expression with 


ie each instant. Hence, the derivativ 
spect to time is equal to zero: 
aw _ Q d TaN). 
a edi ti 
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Since the current strength is equal to the decrease in charge on the 
condenser plates, i.e., 


_ 40 
Seo 
the equation may be simplified as follows: 
Q CLL oe 
o La~ 0. 


Such a relationship between the charge on the plates of a condenser 
and the current strength, which is equal to the derivative of the 
charge with respect to time, can be satisfied only if harmonic oscilla- 
tion of the charge and the current is assumed. 


This becomes evident upon comparing the above relations with 
the equations for mechanical vibrations (see p. 87): 


dQ dz | 
AmA TAr 
dI 1 dv 
ie re Oy m—a-= — ke. 


Charge and current, on the one hand, are analogous to displacement 
from equilibrium and velocity of motion, on the other. As for the 
parameters of the system—inductance is analogous to mass and 
reciprocal capacitance is analogous to the rigidity of the system. 


Let the initial time equal the instant when the condenser is fully 
charged, and assume that 


Q = Qi cos wl. 
Then, 
ISE Qo sin ot, 
Substituting in the differential equation, we obtain 
— LQow? cos wt = s5 Qo cos œt 
or, after cancelling, 
oe 
Vre 


‘Thus, irrespective of the initial ch 
harmonic oscillations occurring in 


quency @) = 


arge on the condenser plates, the 
the condenser have a natural fre- 


The smaller the capacitance and inductance 


of the circuit, the higher the frequency of the electric oscillations- 
What is the situation in a real circuit where the thermal energy 
losses cannot be neglected? Clearly, the total energy of the system 
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in this case will decrease in accordance with the equation 
iW SrA spp oe Ei 
d PRat, i. e, —PR a Qa SDi T a 


ye rentiating aea with respect to time and using the relationship 
etween charge and current, we obtain an equation of the form 


d?I dI 1 
LL R+.. 


At this point, an analogy should be drawn between the corresponding 
electric ‘and mechanical quantities. Comparing the last equation 
with the equation for mechanical vibrations with friction (p. 94), 
it is seen that the electric resistance is analogous to the coefficient a, 
which is a measure of the mechanical resistance. 

- The solutions of such linear differential equations are considered 
in courses in advanced mathematics. We shall simply give the final 
result, which incidentally is easily verified by substitution in the 
above equation: 

I= Iye—®! cos wt. 


The frequency of oscillation is given by 
o= Va. 

Thus, the process is determined by tw ; 
ral frequency of free undamped oscillations, 0 = Vie 


o characteristics—the natu- 
, and the 


damping coefficient, B = Z It is seen, first, that light damping is 
achieved by decreasing the resistance relative to the inductance. (To 

e sure, this is not easily done, for if we increase the number of turns 
on the coil, both quantities increase simultaneously. However, L 
does increase at a faster rate.) Secondly, it will be noted that when 

o< p? i. e, 4L<CR?, 

The discharge of the condenser 
aperiodic process analogous to 
d in a viscous medium from 


oscillations become impossible. 
under such conditions leads to an 
the return swing of a pendulum displace 
its equilibrium position. 


ave a variable condenser whose maximum capac- 


Exampl 5 at we h 
ple. Assume that w esponding inductances of the radio coils 


pance C = 500 pf. Calculate the corr 
or the 1,500 metre and 15 metre wavelengths. : 
is 1. The frequency of electric oscillations corresponding to Ay = 4,500 metres 
S vj= 2x 105 cps = 200 ke/s. Since 


then i 4.2 10-3 henry = 4.2 millihenry. 


o= 2a, = 
4n?vi 


LC’ 
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In order for the process in the circuit to be periodic, the resistance of the 
circuit must be less than 


R,=2 y=- = 3,000 ohms. 


2. Ao=15 metres, vz=2Xx107 cps=20 me/s, L2=0.12x10-6 henry =0.12 
microhenry. In order for oscillations to be possible, the resistance of the 


circuit must be less than R,=2 2- 30 ohms. 
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In a system in which the oscillatory circuit consists of a condenser 
and a coil (particularly if the condenser is composed of large plates 
separated by a short distance and the coil has a large number of 
turns), the electric and magnetic fields are concentrated in their 
respective regions. Therefore, it is possible to consider the electric 
and magnetic energies as two related, but nevertheless distinct 
quantities. This division loses physical significance to a large extent 

_ when we consider rapidly varying fields, where large electric and 
magnetic fields exist in the same regions. 

Recalling what was said in Sec. 113 about the relative nature of 
the division of an electromagnetic field into clectric and magnetic 
components, it should be understandable that it is necessary to 
introduce into the theory the concept of an electromagnetic energy 
that is formally equal to the sum of the electric and magnetic energy 
of the field. The density of electromagnetic energy in space is 

e. 


1 A 2 
w= (eE? + uH’), 


while the electromagnetic energy contained in the volume V is 
; 1 2 
W= \ (eE? + pH?) av. 


V 


In rapidly varying fields, the physical significance of the transfor- 
mation of magnetic energy into electric energy, and vice versa, is 
lost. At the same time, any energy transformations occurring in an 
electromagnetic field must be taken into account in the energy balance 
by a single electromagnetic energy quantity. 

If the above expression for electromagnetic energy is assumed to be 
valid, then, using the electromagnetic field equations of the preced- 
ing chapter, the following theorem for the decrease in electromagnet- 
ic energy within a certain volume of space can be rigorously proved: 

W 
— FP = (P—Q) + $K cosa ds. 
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This theorem was proved in 1884 by Poynting. (It was proved in 
a more general form, i.e., not in connection with an electromagnetic 
field, by N. A. Umov in 1874.) The integral on the right is the flux 
of the vector A.* Using calculations that we have been forced to 
omit due to their complexity, it can be shown that this vector is 
perpendicular to the plane passing 

through the field vectors Æ and H E 

(Fig. 130), and is equal to. K = 

=j} [EA] in the Gaussian system 


of units and to K = [EH] in the 

practical system of units. 

Since the values of the field inten- 

sities decrease quite rapidly with 

increasing distance from the field K 

sources, the flux of the Poynting 

vector is reduced to zero when all 

of space is taken into account. In 

this case, the theorem states: The y 

change in electromagnetic energy is 

equal to the excess of the work of the 

applied forces over the reléased heat. 
However, of most interest is the application of the theorem to a 

finite volume, i.e., when the flux of the Poynting vector is not equal 

to zero. If the volume under consideration does not include currents, 


the equation assumes the form 


dw a 
E cosa dS. 
A $K os 


Fig. 130 


nergy is equal to the flux of the 


` 
The change in electromagnetic e 
ding the volume under 


Oynting vector through the surface boun 


consideration. 
The Poynting vector characterises the flux of the electromagnetic 


energy and the last equation expresses the following fundamental 
Concept: A change in electromagnetic energy within some volume is 
accompanied by an outflow or inflow of an equivalent amount of 
energy. 

In essence, Poynting’s theorem is 
of conservation of energy and the postul 
netic energy can be localised in space- 


ee 
* It should be recalled that in m 
A dS is called the flux of vector A through th 


a direct consequence of the law 
ate stating that electromag- 


athematics an expression in the form 
e surface S. 
S 
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If Poynting’s vector really has the significance of energy flux it 
should be related to the energy density as follows: K = vw (cf. 
p. 118 where an analogous problem is considered relative lo the 
propagation of elastic waves in a medium). By means of Maxwell's 
theory, we can determine v, the propagation velocity of the electro- 
magnetic energy. It turns out that 


c 
Vee ` 

Thus, in vacuum electromagnetic energy should be propagated at a 
velocity c = 3 x 10 cm/sec, which agrees excellently with expe- 
riment. The coincidence between the values of 
c determined from purely electrodynamic expe- 
riments (e.g., the measurement of the interac- 
tion between two currents) and the value of this 
constant determined by direct measurement of 
the propagation velocity of electromagnetic 
waves is remarkable and may be taken as practi- 
cally conclusive proof of the validity of Maxwell’s 
theory. 

In a medium, the value of the propagation ve- 
locity of an electromagnetic wave is ¢ divided by 
Veu. We shall see below under what conditions 
this relationship is satisfied and it shall be 
explained why certain deviations occur. 

Let us now return to the consideration of 
energy transformations in finite regions of space 
that contain conduction currents. 

Assume that in the region under investigation there exists a cylin- 
drical conductor of radius r through which a current of density Í 
flows. The intensity of the magnetic field at the surface of the conduc- 


tor (cf. p. 279) is equal in the Gaussian system of units to H = 


and the magnetic flux lines form circles about the current axis- 
It can be seen (see Fig. 131) that Poynting’s vector is directed into 
the conductor, for the field intensity and the current vector have 


the same direction. As for the numerical value of Poynting’s vector, 
we obtain (at the surface of the conductor): 


C= 


2m oe 
— r 
eld 


gas Ged) Jer 
4n A AS N 


Now, let us determine the flux of Poynting’s vector in a conductor 
segment of length J. This flux is equal to 


K X 2arl =i m V, 
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where V is the volume of the conductor segment. But E is simply the 


thermal energy released per unit volume of conductor. We have thus 
shown that the flux of the Poynting vector enters the conductor and 
transfers to it an amount of energy that is exactly equal to the energy 
expended in heat. 

Where does this flux come from? In exactly the same manner as 
above, it can be shown that the flux of energy comes from those por- 
tions of the conductor where applied forces are present. 

This picture explains how electromagnetic energy is propagated 
along conductors. When electric power is transmitted from Kuiby- 
shev to a consumer in Moscow, the energy is delivered by electro- 
magnetic waves and not by the first electrons initiating the motion 
along the conductor. 


Examples. 1. Let us determine the order of magnitude of the electromotive 
orce produced in the antenna of a radio receiver located at a distance R = 
= 100 km from a transmitter whose power is P = 100 kw = 10% joules/sec. 

The numerical value of Poynting’s vector at the location of the receiving 
antenna is 


r P 105 . 2 
K= ——~___ —§ x {0-7 joule/metres? sec (watt/metre?). 
re Tx (10%) x 10-7 joule/ (watt/ J 


_ In the Gaussian system of units, the Æ and H vectors have the same dimen- 
sions (gm’/2 em—1/2 sec). It can be shown that for an electromagnetic wave 
Propagating in vacuum the numerical values of the £ and H vectors in the 

aussian system of units are equal: E’=H’. For these quantities, the following 
Telationships exist between practical and Gaussian units: 


L volt/metre=—+ x 10-4 statvolt/em; 


1 amp/metre = 4x x 10-8 oersted. 


Then the numerical values of the Z and H vectors in the practical system 


of units are: 


Pox 10-46’; H=40 X 10-3H". 


= H’), we obtain: E = 1207H. 
2 


Therefore, for an electromagnetic wave (E’ 
and £=//1200K = 


In the practical system K = EH; hence, K= 0x 
517 x 10-2 volt/metre. ay 
This means a difference produced in a receiving antenna 


havin i £20 my 
a lengt 4 tre is of the order o: V. 

Cosi An obtained above for K with the value of the solar 

constant, i.e., the energy that would arrive from the Sun each second on 1 cm? 


of the Earth’s surface if there were no losses-in the atmosphere: 


Ksun =0-15 watt/em? = 4,500 watts/metre?. 
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119. Momentum and Pressure of an Electromagnetic Field 


According to the theory of relativity (see P: 423), matter which 
possesses energy also possesses mass. The relationship between mass 
and energy is given by the equation EZ = me?, where c is the propaga- 
tion velocity of light. As we already know, the energy of an electro- 
magnetic field may be considered to have the following density 
distribution in space: 


1 21 u2 
w= y (eb? + wH?). 


Thus, a unit volume of electromagnetic field possesses a mass of 
w 


ae 
Since moving matter possesses mass, it must also have a momen- 
tum equal to the product of the mass and the velocity of motion. 


We conclude, therefore, that a unit volume of electromagnetic field 
has a momentum 


This expression is appropriately called momentum density. 

As stated earlier (p. 319), since Poynting’s vector has the sig- 
nificance of energy flow, it must be related to the energy density in 
accordance with the formula K = we. Comparing the last two for- 
mulas, we see that the relationship between the momentum density 
and Poynting’s vector is given by the expression g = K/c®, where ¢ 
is the velocity. 

Since a flow of electromagnetic radiation possesses mass and momen- 
tum, it will exert pressure on a surface placed in its path. The mag- 
nitude of this pressure may be expressed in terms of the momentum 


density and may vary depending on whether the surface absorbs or 
reflects the wave energy. Of course, intermediate cases are also pos- 
sible. 


In the time At, the electromagnetic field -included in a volume 
ScAt strikes the surface S. If total absorption occurs, a momentum 
equal to gScAt is lost in this time. But momentum divided by time 
is force, and force divided by area is pressure. Hence, the pressure 
exerted on the surface absorbing electromagnetic energy is equal tO 
P = gc, the product of the momentum density and the velocity of 


= ‘ w A 
light, or, since g = Ta the pressure is equal to the energy density W- 


Now, let us consider an ideal elastic encounter 
and the surface. If all the energy of the electrom 
is reflected, the change in momentum will be 


between the field 
agnetic field (wave) 
twice the incident 
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momentum, for the latter has reversed its direction. Just as in the 
purely mechanical cases (p. 68), the force of an elastic impact is 
twice as large as the force of an inelastic impact. Hence, the pressure 
exerted by the wave on an ideally reflecting plate is 


p=2ge or p= 2w. 


The formula for the general case is now easily obtained. If the 
plate reflects part of the energy and the coefficient of reflection is 
equal to p, the pressure of the electromagnetic flux (wave) is given 
by the expression 

p=w(1—p)-+2pw=(1 + p)w. 


_ Using light, P. N. Lebedev verified these formulas experimentally 
in 1900 and thus greatly contributed toward the development of our 
Present conception of the nature of electromagnetic waves. The pres- 
sure of light is exceedingly small even for the most intense sources. 
Por example, the pressure of light on a mirror located at a distance 
of 1 metre from a “lamp” of 4 million candle power is of the order 
of 10-4 dyne/em*. That is why Lebedev’s measurement of the pressure 
of light with an accuracy of 1-2% is viewed as a great experi- 
mental achievement. 

Basically, Lebedev’s apparatus consisted of a pair of vanes at- 
tached to a light-weight suspension. One vane was an excellent absorb- 
er of light and the other an excellent reflector. The light was directed 
first at one vane and then at the other, and, then, by comparing the 
displacement angles, it was possible to determine the magnitude of 
the force. The chief difficulty was how to take into account the ef- 
fect onthe vanes of residual gas heating in the vessel containing the 
Suspension. ws 

As we have just seen, the theory of variable electromagnetic fields 
has led to the conception of a field as a physical reality (electromag- 
netic radiation). The great merit of Lebedev’s experiments is that 
they provided direct proof of the validity of this conception. 

An electromagnetic field possesses energy and momentum, is 
Propagated in space with a specific velocity and exerts pressure on 
an obstacle, We shall see below (P- 560) that an electromagnetic 
Held may be transformed into matter. All these facts taken together 


irrefutably prove that an electromagnetic field is a physical reality. 
4 
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CHAPTER XVIII 


ELECTROMAGNETIC RADIATION 


120. Elementary Dipole 


Electromagnetic radiation occurs whenever a variable electromag- 
netic field is created in space. An electromagnetic field, in turn, 
varies in time whenever the distribution of electric charge in a system 
changes or the density of an electric current varies. 

Thus, every variable current and pulsating electric charge is 
a source of electromagnetic radiation. 

Magnetic and electric dipoles having variable moments—particu- 
larly the latter—are the simplest systems producing electromagnetic 
fields. A system consisting of a stationary positive charge about 
which a negative charge oscillates constitutes such an electric dipole. 
If the oscillation is sinusoidal, the dipole moment will also be sinus- 
oidal, i.e., it is represented by the formula p = p cos wt. This 
simple radiator model has very great significance since many real 
seus can be represented to a high degree of accuracy by ideal 

ipoles. 

It will be recalled (see Sec. 93) that the electric properties of 
a system whose “centres of gravity” of positive and negative charge 
do not coincide may be described in terms of the dipole moment of 
the system. But most radiators of electromagnetic energy are elec- 
trically neutral systems whose positive and negative charges are 
capable of being displaced relative to each other. This is primarily 
because atomic and molecular systems fall under this heading. An 
electron rotating about the nucleus of an atom is a system having 
a variable dipole moment, and a neutral molecule whose atoms are 
in a state of oscillation is also, frequently, a system having a variab- 
le dipole moment. Howeyer, our interest in the electric dipole extends 
further. In the following article, we shall see that a linear radio anten- 
na may be likened to a dipole. (Incidentally, the analogous terms 
“oscillator” and “vibrator” are somewhat broader in meaning than 
the exact term “dipole”.) 

Magnetic dipoles occur when the electric charge distribution and 
hence the dipole moment of the system remain unchanged while the 
current density and hence the magnetic moment of the system change 
with time. A typical example is a loop in which an alternating elec- 
tric current flows. If the current flows in a closed circuit. the electric 
charge is neither accumulated nor dissipated anywhere. ‘The electric 
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dipole moment of such a loop equals zero and does not change. 
However, the loop’s magnetic field, which is related to the value 
of its magnetic moment, varies and, therefore, electromagnetic 
energy is radiated. It follows from the theory that if a system pos- 
sesses simultaneously an electric and a magnetic moment the radia- 
tion from the magnetic dipole at large distances from the source is 
usually much less than the radiation from the electric dipole. 

If a dipole radiates by giving up internal energy or, as in the case 
of an antenna, by transforming the energy of an external source 
into radiation energy, the dipole is called a primary radiator. How- 
ever, a secondary radiator is also of considerable interest. In this 
case, a dipole is made to oscillate by the action of an electromagnet- 
ic wave and becomes a radiator only as a consequence of this action. 
Secondary oscillations are particularly intense when the primary 
wave is of the same frequency as the natural frequency of the dipole 
(resonance). 

y Setting a dipole into an oscillatory state may be viewed as a mechan- 
ical process—the jostling of the charges by an external force equal 
to the product of the charge and the field intensity. At the same time, 
the process of creating secondary oscillations in a receiving antenna 
may be viewed as an induction process in which an alternating 
electric current is produced by an alternating magnetic field. To the 
extent that the antenna may be replaced by a dipole, both views 


are equivalent. 
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An important difference exists between the state of oscillation of 


an oscillatory circuit (p. 313) and the oscillation of the current in an 


antenna. In discussing the electric oscillation of a circuit, we referred 
and a definite instan- 


to a definite instantaneous current strength 
taneous charge on the condenser plates. It was assumed that the cur- 
rent strength in all parts of the circuit was the same, that the elec- 
tric charge was concentrated on the condenser plates and, hence, at 
a given instant could only have a single value. ; : 
_ The electric oscillation in the case of an antenna cannot. be viewed 
in the same manner as the oscillation of a pendulum. However, the 
Oscillation of an electric current in an antenna does have a mechan- 
ical analogue. This oscillation is very similar to the vibration of 
a rod or string, i.e., it can be represented by a standing wave. 
This may be strikingly demonstrated by showing that an excited 
antenna has current nodes and antinodes. A small bulb may serve 
as the current indicator (Fig. 132). It turns out that a conduction 
current antinode exists at the centre of a free section of line in which 


electromagnetic waves are excited and that conduction current 
24* 


"ae 
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anodes exist at the ends. In such a line, the current at all points is 
unidirectional at each instant. At some instant the current at each 
point decreases to zero and then begins to flow in the opposite direc- 
tion. The electric charge, which is distributed continuously along 
the line, varies accordingly. Clearly, as long as current flows in one 
direction, positive charge is accumulated on one half of the line 
and negative charge is formed on the other. When the current decreases 
to zero, the charges at the ends are a maximum and of opposite 
sign. The current then begins to flow in the opposite direction and 
the charges decrease, becoming zero when the current strength in all 


- es 
p $ 
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Fig. 132 


parts of the line is a maximum. At this instant, the recharging proc- 
ess commences, charges of opposite sign are accumulated on the 
two halves of the line, etc. 

The reader will note that at each instant charge 
are located on the two halves of the line. Two charges 


hat are equal 
in magnitude but opposite in sign, and separated by acertain distance, 
constitute an electric dipole, 


It can he stated, the fore, that the 
electric oscillations of an antenna are very similar o the oscilla- 
tions of an electric dipole in w moment decreases 


from a maximum positive value to zero, then increases in the oppo- 
site direction, then again decreases, etc. 
The field of an antenna differs from that of a dipole only in the 


tances hundreds of times greater 
ı the field created by the antenna 


ated by an ideal electric dipole. 


alogy between an antenna and a rod. 
ectric oscillation 


opposite sign 


hich the dipole 
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on the length of the antenna, although such half-wavelength di- 
poles are mainly used in UHF engineering. The following relationship 
exists between the length of the antenna and the wavelength: 


À 5 
L =n (see p. 131). Thus, an antenna of length Z can receive and 


radiate waves of wavelength A satisfying the above relationship. 

In the field of radio, a number of methods exist for varying the 
natural frequencies of an antenna. Basically, they consist in the 
connection of a self-inductance coil or a condenser to the antenna. 
By varying the inductance or capacitance, the natural frequencies 
of the antenna may be varied within broad limits. 


122. Radiation Pattern of a Dipole 


The radiation pattern of a dipole may be determined experimental- 
ly. It turns out that the results are in complete accord with the theo- 
ry first advanced by Hertz. We shall be concerned only with the 
results of experiments and theoretical calculations, restricting our- 
selves to the field far removed from the dipole, i.e., to the so-called 
wave zone. This is the region in which the distances to the dipole 
are considerably greater than the dipole dimensions. 

Irrespective of the complexity of dipole oscillations, the oscilla- 
tions may always be resolved by means of Fourier’s theorem into 
their spectra, i.e., they may be represented as the sum of harmon- 
ic oscillations of frequencies œ, 2%, 3a, etc. Therefore, it is quite 
sufficient to consider the electromagnetic field of a dipole whose 
moment varies in accordance with the harmonic relation p = 
= Po cos wl. 

Calculations and experiments show that the field of such a system 
may be represented by a spherical wave propagating with a velocity 

= Va The electric and magnetic vectors of the wave are at 

eu 
right annie to each other and also at right angles to the direction of 
Propagation. The latter circumstance, incidentally, follows from 

Oynting’s theorem. 

In the wave zone, the electric and magnetic vectors vary in phase, 
performing harmonic oscillations at every point in space. A simple 
relationship exists between the numerical values of the field inten- 


Sity vectors, namely: 


V:E=V pe. 
Hence, Poynting’s vector may be written in the following form: 
€ p2 eB 4 
Kms AV hte aa 
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Thus, the wave intensity, i.e., the energy passing through unit 
area per unit time, is proportional to the amplitude squared of the 
electric field intensity. 

The radiation of a dipole is not the same in all directions. The 
amplitude, as well as the intensity, depends on the angle of inclina- 
tion of the propagation direction to the axis of the dipole. In the 
direction perpendicular to the dipole axis the radiation is a maxi- 
mum, while in the direction of the dipole moment it is equal to 
zero. Theory gives us the following expression for the electric field 
intensity: 


poo? s =) 
E= zzp Sin 8 cos œ (= alia 


where the factor before the cosine is the w 
E. The angle 0 is the angle between the propagation direction 
and the dipole axis. The expression for the 
magnetic field deviates from the above 
only with respect to a slight difference in 
the amplitude factor. 
Fig. 133 shows a diagram sometimes used 
to represent the dependence of radiation 


intensity on direction, Here, a radius vec- 
0 


ave amplitude of vector 


Al2 


Fig. 133 


tor is shown intersecting 
the radiation intensity is given by the 
ured to the point of intersection. 
The fact that the amplitude is 
quency} squared is very im 
of the dipole depends ver 
Portional to the fre 


the radiation pattern. If the scale is known, 


length of the vector meas- 


Proportional to the radiation fre- 
portant. Clearly, the radiation intensity 
y greatly on the frequency, i.e., it is pro- 
quency to the fourth power: : 


r 4 
i K~ JF sin? 0. 
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= 3 : 1 
Thus, when the frequency is halved, the intensity decreases to 75 


of its original value. 
Theory has led to an important conclusion regarding the orthogo- 
nality of an electromagnetic wave. This is illustrated in Fig. 134, 
where it is seen that the electric and magnetic vectors are perpendicu- 
lar to the direction of propagation. Asa result, the properties of an 
electromagnetic wave change when the wave is turned about the 
direction of propagation. This phenomenon is known as polarisation. 
: Mapping the flux lines of a radiat- 
ing dipole is of no particular inter- a 
est. Fig. 135 shows vectors of elec- / 
tric field intensity for several points 
in space. The field is rotational and 
the flux lines are closed. When radia- 
tion occurs, the close lines expand in 
the direction away from the radiator. 
The magnetic lines of force consist of 
circles about the dipole axis. At great 
distances from the dipole, the spheri- 
cal wave practically does not differ 
from a plane wave. Of course, the 
orientation of the #, H and K vectors 
in a plane wave and the numerical 
relations given above remain the same. 


N 
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According to theory, electromagnetic a 
radiation occurs when electric charges Fig. 135 


are accelerated nonuniformly. A uni- ford 
form or free flow of locke charge does not produce radiation. 
harges moving under the action of a constant force, e.g., charges 
describing a circle in a magnetic field, also do not radiate. 
Tn oscillatory motion, the ‘acceleration is continuously changing. 
lence, electric charge oscillations produce electromagnetic radia- 
tion. Electromagnetic radiation also occurs when charges are abrupt- 


ly decelerated. Thus, when a beam of note mediation Ae 
get, X-rays are produced. Electromagnetic rad1atio Bhar, 
during prio Pemi motion of particles (the at WEEN dni). 
he pulsations of a nuclear charge produce $ tromagnety ; 
radiation known as y-tays. Ultraviolet rays a Svisiblevlight ake s 
Produced by the motion of atomic electrons. Elfcfri¢ charge oscilla} 
tion on a cosmic scale is exemplified by the rad{ntyo of radidywaves f 


by heavenly bodies. G GRET / 


EN 
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In addition to natural processes in which various kinds of electro- 
magnetic radiation are produced, a num ber of experimental means 
exist for creating electromagnetic radiation. aoe. 

The main characteristic of electromagnetic radiation is its fre- 
quency (in the case of a harmonic oscillation) or its frequency band. 
Of course, using the relation ¢ = vA, the length of the electromag- 
netic wave in vacuum may be determined if the radiation frequency 
is known. 

The radiation intensity is proportion 
fourth power. Hence, very low-frequency radiation having wave- 
lengths of the order of hundreds of kilometres cannot be traced. 
Practically, the radio band begins with wavelengths of the order 
of 4 or 2 km, which corresponds to frequencies of the order of 150 ke/s. 
Wavelengths of the order of 200 metres are in the medium-frequency 
band, while those of the order of tens of metres are in the short- 
wave band. Ultrahigh frequencies (UHF) are beyond the usual radio 
frequencies; wavelengths of the order of several metres and fractions 
of a metre, down to a centimetre (i.e., frequencies of the order of 
1019-1011 me/s), are used in the television field and in radar. 

In 1924, Glagolyeva-Arkadyeva obtained ev 
netic wayes. Electric sparks produced betweer 
ed in oil served as her so 
0.1 mm were obtained. TI 
radiation wavelengths. 

Visible light occupies a very small band 
7.6 X 10-5 cm to 4 x 10-5 cm. This band is followed by ultraviolet 
rays, which are invisible but very easily detected by means of labo- 
ratory equipment. These wavelengths extend from 4 x 10-5 cm to 

>> cm. 

Following the ultraviolet band is the X-r 
in this band extend from 10-5 cm to 10-10 


wavelength, the less the absorption by matter, Electromagnetic 
radiations of shortest wavelength, which are the most penetrating, 
are called y-rays (wavelengths of 10-9 em and Jess) 

The nature of any of the enumerated electro 
may be completely determined as follows: First, the electromagnetic 
radiation is resolved into a Spectrum by some method or other. In 
the case of light, ultraviolet rays and infrared radiation, this may be 
accomplished by refraction through a prism or by passing the radia- 
tion through a diffraction grating (see below). In the case of X-rays 
and y-rays, resolution into a Spectrum is achieved by reflection from 
a crystal (see p. 385). The spectrum of radio waves is determined by 
making use of the phenomenon o 


f resonance. 
a a $ 
The radiation spectrum obtained may be continuous or discrete, 
i.e., all frequencies may be present in 


a broad band of the radia- 


al to the frequency to the 


en shorter electromag- 
n iron fillings suspend- 
urce. In this manner, wavelengths down to 
ius, an overlap was achieved with thermal 


wavelengths— from 


ay band. The wavelengths 
cm. The shorter the X-ray 


magnetic radiations 
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tion spectrum or the spectrum may consist of individual sharp lines 
corresponding to very narrow bands of frequency. In the first case 
the spectrum is represented by a curve of intensity vs. frequency (or 
wavelength), while in the second case the spectrum is described by 
giving the frequency and intensity of the lines. ; 

Experiments show that electromagnetic radiation of given frequen- 
cy and intensity may not always have the same polarisation state. 
In addition to radiation in which the electric vector of the waves 
oscillates along a specific line (linearly polarised waves), radiation 
also exists in which linearly polarised waves turned relative to each 
other about the beam axes are superimposed. Hence, to completely 
describe radiation, it is also necessary to indicate its polarisation. 

_ It should be noted that even for the slowest electromagnetic oscilla- 
tions, the electric and magnetic vectors of a wave cannot be meas- 
ured. The above descriptions of a field are based on theory. Neverthe- 
less, in view of the continuity and unity of all electromagnetic 
theory, there is no reason to doubt their veracity. 

The assertion that one or another kind of radiation consists of 
electromagnetic waves is always based on indirect evidence. How- 
ever, since these hypotheses have so many consequences that are in 
complete mutual agreement, the electromagnetic spectrum hy- 
Pothesis long ago became accepted fact. 


124. Quantum Nature of Radiation 
156) that investigation of atomic 


Phenomena has led to a law which states that the internal energy 
of a system cannot assume any arbitrary value, but is characterised 
instead by a system of energy levels. Energy radiation is related 
to the transition of a system from a higher level to a lower level. 
“nergy absorption is related to the transition to a higher level. 
This applies, in the first place, to electromagnetic radiation. The 
quantum nature of submicroscopic phenomena was discovered at the 
eginning of this century as a result of investigation of a number of 
Conflicting facts regarding electromagnetic radiation. 
hus, the emission of electromagnetic radiation of frequency v 
Y a dipole occurs in quanta (packets) rather than continuously. 
A quantum of energy is equal to hv, where is Planck’s constant and 


1S equal to 6.62 x 10-7 erg sec. 3 : . 
he quantum nature of electromagnetic waves 1S manifested in 
absorption as well as radiation, for absorption too can only occur 
Y means of energy quanta. If the value of an energy quantum is 
equal to the difference between certain energy levels of a system on 
Which the wave impinges, the absorption process 1S quite pronounced. 
uch a process may be called resonance absorption. From the stand- 


We have already noted (p. 


we. 
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point of classical physics, such absorption occurs when the frequency 
of the external field is equal to the oscillation frequency of the par- 
ticles constituting the system. If the value of the electromagnetic 
wave quantum is less than the difference between energy levels, 
absorption cannot occur and the wave passes freely through the 
system. r . bed 

In quantum terms, the secondary radiation of a system is describe 
as follows: A system absorbs a quantum of electromagnetic. energy 
and is raised to a higher energy level. The system maintains this 
level for a certain period of time and then returns to its former 
energy level by giving up energy—again in the form of a quantum. 

Since a quantum of energy is equal to hy, it is immediately evident 
that the higher the radiation frequency, the more pronounced the 
quantum phenomena. Nevertheless, the quantum nature of radiation 
has already been observed in practically all regions of the electro- 
magnetic spectrum. It has even been possible to observe the quantum 
absorption of radio waves having wavelengths of several hundred 
metres (radiospectroscopy—p. 549). 

The presence of one or another spectrum of electromagnetic radia- 
tion depends, in the first place, on the arrangement of 
in the system under consideration and on the transiti 
ties of the system from an n-th level to an m-th level. I 
bilities were known beforehand and the energy level 
available, it would be an easy task 
trum of the system. 


We shall repeatedly be dealing with problems of radiation and 
absorption of electromagnetic energy, but now let us consider some 
problems of electromagnetic wave propagation in which the quantum 
nature of radiation is not manifested when the phenomenon is not 
accompanied by the absorption and radiation of energy. 


energy levels 
on probabili- 
f these proba- 
diagram were 
to determine the radiation spec- 


CHAPTER XIX 


PROPAGATION OF ELECTROMAGNETIC WAVES 


125. Dispersion and Absorption 


„In a homogeneous medium, the velocity of propagation and the 
direction of an electromagnetic wave do not change. The velocity 
of the wave is a maximum in vacuum. In a medium, the wave velo- 
city is 

c 
Vam 


= 


and since in most practical cases p = 1, we obtain 
i c 
Ve ` 


The ratio of the velocity of wave propagation in vacuum to the 
velocity of propagation in a medium is called the index of refraction. 
Thus, from electromagnetic theory we obtain the equation n = Ve, 
which is quite valid for very long wavelengths. As the wavelength 
changes, the index of refraction changes. This dispersion is alien to 

Taxwell’s electromagnetic theory, which regards a medium as a con- 
tinuum and does not take into account the interaction of radia- 
tion and matter. Be that as it may, the equation n = Ve is not 
valid for rapid electromagnetic! oscillations. 

Vhen an electromagnetic wave is propagated through matter, the 
electric charges of the molecules are set into a vibratory state. Since 
an electron cloud moves freely as compared with heavy nuclei, 
electric oscillation consists in the displacement of the centre of 
Sravity of the electrons relative to the stationary centre of gravity 
of the atomic nuclei’s positive charges. Designating the charge and 
Mass of the oscillating electrons by e and m, respectively, the oscil- 
ation equation may be written in the form 


T 


ma = —ka—eHy cos ot. 
Or, dividing by m and using the formula for the natural frequency of 
Oscillation, i.e., œ? = ue we obtain 


f. D 
= — o?r —— Hy cos ot. 
r= — 0t — p 2o 
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We have equated the product of mass and acceleration to two ae 
the restoring force—kx and the external, periodically varying fore 
eH, cos wt. This is the equation of forced harmonic oscillations- 
It is satisfied if 


T = To COs wl. 


After substituting in the equation, we obtain 


The polarisation vector, i.e. 


, the dipole moment per unit volume, is 
N times larger, where V is tl 


he number of molecules per unit volume: 
Ne2 


Recalling the formul 


a relating the Polarisation to the field inten- 
sity, i.e., 


`» 


it is seen that the 


permittivity of the medium has be 
in terms of the par, 


en expressed 
ameters of the molecular dipole: 


A4nN e2 
m 
c= é 
zi OF — w2 
_ The index of refraction of the medium should be equal to the square 
root of this expression, 


re of the dependence is confirmed 
by experiment. Here, the index of r i 
based on the aboy 


substances, * What is the basic conclu- 


experimental and theoretical results? In 
general, the index of refraction increases with increasing frequency 
in the entire frequency range, except for the region in the immediate 
Vicinity of resonance absorption. This region is called the anomalous 


* A more exact theory results in a 


j 0 greement w 
in the region close to @ as well. 


ith the experimental values 
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dispersion region. A substance may have several resonance frequen- 
cies, which correspond to the differences between its energy levels. 
Hence, there will be a corresponding number of anomalous disper- 
sion regions. 

Thus, the index of refraction of a wave, and hence the velocity 
of propagation, greatly depends on the value of the wave frequen- 
€y relative to the natural 


frequencies of the molecular i; 
dipoles. z / ——- Theor 
Naturally, the capacity I y 


— Experiment 


of a substance to absorb an 
electromagnetic wave de- 
pends on the same factors. 
Using the same reasoning 

as in the case of elastic 1 
Waves (see p. 120), we arrive 
alba completely analogous 
formula: 


Teen tan 


which enables us to deter- 
mine the valueof the radia- 
tion intensity J relative to wy 4 
the incident intensity Zo if 
the absortion coefficient u 
and the layer thickness d 
through which the wave has passed are known. It should be recalled 
that the absorption coefficient is equal to the reciprocal of the layer 


thickness which decreases the radiation intensity to — of its original 


Value. Due to the complex system of energy levels peculiar to mat- 
ter, the curve of the absorption coefficient plotted against the 
"equency of the incident wave may appear odd and “erratic”. — 
nlil now we have been considering dielectric media containing 

nly bound electric charges. Other relations exist when an electro- 
magnetic wave is propagated in a medium in which a considerable 
Number of free electrons are present. Such media include metals and 
le ionosphere—a region of free charges akin to a gas. Using the 
‘eory presented above, it must be assumed that in the formula for 
e the natural frequency ao of a free charge is equal to zero (the fre- 
quency is proportional to the rigidity of the bond). The dielectric 
“onstant is then given by the formula 
4aNe? 

m 


Sa: 


w? 


Fig. 136 


e=1— 
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When @ becomes sufficiently large, the index of refraction n= Ve 
approaches unity. But when œ? < 4aNe?/m, the index of refraction 
is imaginary. This means that for the given values of frequency the 
waves cannot penetrate the metal or the ionosphere. On the other 
hand, for high frequencies the waves are “indifferent” to the presence 
of a medium containing electrons. These predictions are borne out in 
the case of radio waves. Thus, long and medium waves are reflected 
from the ionosphere and do not penetrate it, short waves penetrate 
the ionosphere, and uhf waves pass through the ionosphere unim- 
peded. 

The above presentation is greatly oversimplified. Hence, it should 
not be surprising that the conclusions are not valid for the optical 
region where the values of the index of refraction may be close to 
zero and also much greater than unity. 


126. Behaviour of an Electromagnetic Wave at 
the Boundary Between Two Media 


Just as in the case of an elastic wave, an electromagnetic wave is 
reflected and refracted at the boundary between two media. The basic 


E k laws of these phenomena 
may be subjected to theo- 
retical analysis by utilising 
4 the boundary conditions om 
the electromagnetic field 
JESN. m ot 

RaT — vectors. These conditions, 

eflected Incident S E 
wave wave discussed on pp. 259 and 
291, follow in turn from 
Maxwell’s equations. Since 
the relationships between 


the fields on either side of 
the boundary are not arbitrary, the division of the wave into reflect- 


ed and transmitted components is also not arbitrary. 

The two fundamental relations may be expressed as follows: the 
tangential components of the electric and magnetic vectors on either 
side of.a boundary must be equal. 

What restrictions are imposed by these relations in the simple 
case of normal incidence? This case is illustrated in Fig. 1437. Assum- 
ing that the electric vectors are in the plane of the page, then the 
magnetic vectors are perpendicular to this plane. We know that the 
electric and magnetic vectors and the direction of propagation may 
be viewed as a right-handed screw system, i.e., the rotation of 
vector Æ by the shortest path toward vector H appears counterclock- 
wise to one facing the oncoming wave. To satis 


C ng we fy this requirement of 
electromagnetic theory, the direction of either vector H or vector E 


Fig. 137 


| 
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must be reversed for the reflected wave. Thus, either the magnetic 
or the electric vector undergoes a 180° phase shift when the wave 
is reflected. 

When considering oblique incidence, it is necessary to determine 
which of the two actually occurs. It turns out that both are possi-. 
ble—one when the wave passes into a medium of larger € and the 
other when it passes into a medium of smaller e. 

For normal incidence, the following calculations do not depend on 
the scheme chosen. Let us write the boundary conditions in the form 


Eincia — Eyeptect + Evepract 
and 
Hincia= — Hrejtect +H retract- 
But the following relationship exists between. the numerical values 
of the H and E vectors: 
H=V£E=nE. 


Hence, we obtain two equations, 
Eincia == Ereftect SF Evejract 
and 
NE incia = — ME ye jtect + NE yejract) 


whence the ratios Ereftect/ E incid and Erejract/Eincia MAY be deter- 
mined. Since the wave intensity is proportional to the amplitude 
Squared and the index of refraction (p. 326), we obtain for the coeffi- 
cients of reflection and transmission the following simple formulas, 


where n= 2 is the relative index: 
ny 
Exeflect Ji T OE 


coefficient of reflection = ( Braa) T QFIE 
S, Brafract. Ne ån 
coefficient of transmission = (Fp) EE 


f elastic waves (cf. p. 128) is very great. 
the general results presented below 
beam inclination and polari- 
t with experimental results is 


The similarity to the case 0 

Y means of such calculations, 
Were obtained for the case of arbitrary 
Sation state of the wave. The agreemen 


Tuite satisfactory. SA F : 
ince the sum of the reflection and transmission coefficients is 


equal to unity, the theoretical results are completely described by 
rad 138, where the intensity of the reflected wave is plotted as a 
unction of the angle of incidence. 

i alculations and experiments show 
urve depends to a significant extent 0 


that the nature of the reflection 
n the polarisation state of the 
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incident wave relative to the plane of incidence. The electric field 
intensity vector Æ is “more important” than the H vector, if only in 
the sense that the photochemical action is due to Æ. Therefore, in 
describing the polarisation state of a wave, it is customary to specify 
it with respect to the electric vector. The orientation of the H vector 
is always easily found if the direction of propagation is known. Thus, 
it turns out that the reflection coefficient is different for two waves 


>) . that are incident at one and the 
i same angle @ on the same boundary 
j if in one case the electric vector 
isin the plane of incidence and in 
the other it is perpendicular to 
this plane. In the figure, curve I 
corresponds to the case when the Æ 
vector is perpendicular to the plane 
of incidence, curve ZII corresponds 
to the case when the Æ vector is in 
the plane of incidence and curve Z 
corresponds to the case when the 
wave is not polarised. i 
In the first case, the change in 
the reflection coefficient is monoton- 
ic—for normal incidence there is 
little reflection, the coefficient 
being of the order of 5%; then, 
with increasing angle the reflection 
coefficient increases, ever more rap- 
idly, until the glancing angle is 
reached. A beam whose electric vector is in the plane of incidence 
behaves in an entirely different manner. Its reflection intensity 
decreases with increasing angle until it reaches zero at the angle @p- 
This angle is determined by the following interesting equation: 
n = tan Pp. The figure is plotted for the value n — 1.52 (transition 
from air to glass). Hence, the angle at which the reflection coefficient 


decreases to zero is equal to 56°40’. For a further increase in angle, 
the coefficient of reflection begins to increase and finally reaches 
unity. 


What is the reason for the absence of reflection in this particular 
case? How does this case differ from others? Evidently, the answer 
must be sought in the boundary conditions, from which the entire 
theory of the phenor 


nenon proceeds. We leave it to the reader to 
construct the field vectors for this angle and illustrate the require- 
ment. 


The following question may arise in the reader's mind: If the 
boundary conditions enable us to understand all Phenomena at the 


08 


04 


Fig. 138 
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boundary between two media, then what about total internal reflec- 
tion where there is a field in one medium (£,; = 0) but none in the 
other? The question is perfectly valid and the theory provides an 
answer. It turns out that under conditions of total internal reflec- 
tion the field penetrates the second medium but is not propagated 
deeply into the medium. The condition Ey, = E» is not violated. 

A number of experiments have been devised for the demonstration 
of light wave penetration into a second medium under conditions of 
total internal reflection. Suffice it to recall the basically simple 
experiment proposed by Mandelshtam. A glass prism is partially 
immersed in a solution of fluorescein—a substance exhibiting 
a characteristic fluorescence under the action of light. Then, a beam 
of light is directed onto the prism is such a manner that total reflec- 
tion occurs on the inner side of the prism surface that is immersed 
in the solution. The fluorescein thereupon glows intensely in an 
extremely thin layer next to the glass, proving that the electromag- 
netic wave has penetrated the solution. 


127. Natural and Polarised Light. Polarisation 
Upon Reflection 


Place a glass plate at an angle Pp to a light beam. The beam is 
reflected. Then, if the beam is turned about its axis (actually, the 
Source of light is turned about the beam axis), it might be expected 
that at some position the beam will not be reflected. But if natural 
light is used for the experiment, this does not occur, 1.e., for every 
azimuthal position of the incident beam, the reflected beam has the 
Same intensity. It would be wrong to consider this a refutation of the 
theory presented in the preceding article. This experiment merely 
Shows that the polarised state of a beam of natural light is more 
Complex than given by the scheme of two vectors, # and H, having 
fixed directions of oscillation. Re 

Now, let the above beam, reflected at an angle p;, impinge on a 
Second plate placed at a similar angle ps to the beam reflected from 
the first plate. Then, turn the beanr about its axis. Since, of course, 
only the relative position of the beam and the reflector is of impor- 

ance, it is easier to turn the second glass plate. Investigation of this 
double reflection shows that the reflection varies with the position, 
and the position for which no reflection occurs 1s easily found. It is 
evident that this position corresponds to a mutual orientation of 
eam and reflector for which the electric vector of the beam is in 
the plane of incidence. The following conclusion may be drawn: 
reflection from the first reflector results in the natural beam acquir- 
ing a polarised state in which a single oscillation direction of the 
electric vector is separated out. 
221409 
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In contradistinction to natural beams, beams in which the vectors 
have a specific oscillation direction are referred to as polarised ae. 
How should the polarised state of a natural beam be envisaged? 
It is necessary to assume that in a natural electromagnetic wave all 
possible oscillation directions of the electric vector are uniformly 
present. The word “possible” should be underlined since electromag- 
netic theory shows that the electric vector is of a transverse nature. 
In essence, therefore, a natural unpolarised wave is a superposition 
of numerous linearly polarised waves having a uniform distribution 
as regards the vector oscillation directions. All transverse directions 
are electric vector oscillation directions of a beam of natural light. 

Reflection from two successive reflectors, fixed at an angle pp to 
the beams, is one method of polarising beams of light. 

An electric vector of natural light may always be resolved into two 
mutually perpendicular components. When reflection is being inves- 
tigated, it is most convenient to resolve each vector into two com- 
ponents—one in the plane of incidence and the other perpendicular 
to this plane. Thus, the behaviour of a natural beam may be equated 
to the behaviour of two such component waves, if we take into ac- 
count that the phase difference between them varies randomly. There- 
fore, in describing the polarisation of light, we say that one of the 
components has not been transmitted, or has been transmitted, to 
such and such an extent. If upon reflection or refraction one of the 
components of light is transmitted to a larger extent than the other— 
which the reflection curves show to be the case—this signifies that 
the light has been partially polarised. 

We can utilise this phenomenon to obtain total polarisation of a 
beam. Instead of using two reflectors fixed at an angle p to the 
beams, it is much easier to transmit a beam through a pack of glass 
plates. Each refraction will increase the share of one of the compo- 
nents in the beam by a certain percentage. In this manner almost 
total polarisation may be achieved. 

The natural state of a light beam is unpolarised. However, this 
does not mean that every beam that has not been subjected to rellec- 
tion.or refraction is unpolarised. This applies particularly to radio 
waves. The short electromagnetic waves used in the transmission of 
television are highly polarised. It is precisely this circumstance that 
enables us to determine the direction of the transmitter by the orien- 
tation of the receiving antenna. The electromagnetic waves which act 
as the carrier of a television programme are highly polarised. Hence, 
the antenna must be oriented in such a manner that the oscillation 
direction of the electric vector coincides with the antenna direction. 


128. Propagation of Light Waves in a Medium Having a Gradient 339 


128. Propagation of Light Waves in a Medium Having 
a Refractive Index Gradient 


As a rule, a difference in density is associated with a difference in 
refractive index. The natural question arises: What is the nature of 
the wave propagation in a medium in which the value of the refrac- 
tive index varies from point to point, i.e., the refractive index gradi- 
ent differs from zero? 

A difference in refractive index signifies a difference in the veloci- 
ty with which the wave front advances. It follows, therefore, that 
the wave front will be continuously deformed as it advances in such 
a medium. If we construct the normals to the wave front, we obtain 
a curved line. Thus, it can be stated that in a nonhomogeneous medi- 
um light is propagated curvilinearly rather than rectilinearly. 

The analogous problem for sound waves was discussed earlier 
(p. 147). The same laws are applicable here and the beam path is 
also determined by Fermat’s principle. A beam of light propagated 
in a finite medium having a refractive index gradient follows a path 

etween two points that requires a minimum amount of time to 
traverse. Therefore, the beam of light bends so as to shorten its path 
in regions where the refractive index is large and lengthen its path 
in regions where the refractive index is small. + 

The best example of the propagation of light in a medium of gradi- 
ent z is the passage of a beam of light through the Earth’s atmosphere. 
Since the density and the index of refraction of air decrease with 
increasing elevation, it follows that refraction occurs in the atmos- 
Phere. A beam travelling from a star to the Earth and entering the 
atmosphere at an angle rather than along a radius will bend; hence, 
the apparent position of the star is displaced relative to its true posi- 
tion. For a star at the zenith, the displacement angle is as much as 
72 of a degree. rte , 

_ Mirages are caused by the presence of a refractive index gradient 
im the atmosphere. They occur in the African deserts due to the fact 
that heat currents are easily formed above the hot sand, resulting in 
temperature gradients and, hence, density and refractive index gra- 
lents. As a result, the light beams travel along curved lines and 
a landscape seems to appear where the observer, accustomed to the 
rectilinear propagation of light, conceives it. 

f course, in the case of light propagation in a nonhomogeneous 
Medium, the waves are neither spherical nor planar. It should be 
recalled that a variable propagation velocity signifies that the wave- 
€ngth also varies from point to point. What is the equation of wave 
Motion in a medium where the refractive index changes from point 
© point? Since the parameters of the wave change from point to 

22* 
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point. the equation being sought must be a dilferential equation, for 

only a differential equation can express the dependence between the 

physical quantities for a given point in space. s 
This equation may be found by means of Maxwell’s equations. 

Since the derivation is rather complex, it will not be presented here. 

The calculations yield the following relation, which is valid for the 

E vector (or its projection) as well as the H vector (or its projection): 

ay 1 ay 


ðs 2 ate * 


The function ¥ is called the wave function. It represents the Æ 
vector, the H vector, or their components since the equations are the 
same for each case. Here, s is the coordinate in the direction of wave 
propagation, ¢ is the time and v is the propagation velocity. 

This equation is called the wave equation and it is valid for all 
points in space located outside the sources of the field, i.e., outside 
charged regions and regions in which electric currents flow. 

First it will be shown that the above differential equation is satis- 
fied by the simplest wave process, i.e., a plane wave. As we know 


(p. 114), the expression for a plane wave of frequency w, propagating 
in the direction s, has the form < 


E Y= Acoso (1 —=) k 


Let us determine the second derivative of the wave function WY with 
respect to time and with respect to the coordinate. We obtain 


aw aay o2 
Uae 


It is seen that the followin 


g relationship must exist between the 
second derivatives: 


ow 1 ory 


gs2 T D op? 


hence, the equation of a plane 


c h K wave is embodied in the proposed 
differential equation. Howey. 


er, the above differential equation em- 


braces much more. Any function of argument (=>) is a solution 


of the equation since for any function Y (—<) the derivative 
à, v 

expressions in ¥ are the same. 

The dependence of a function on the argument 


as the sole indication of a wave 
argument consists in the followin 
is characterised at the instant of ti 


(—=ż) is regarded 
process. The significance of this 
g: If the state at a point s = 

met = 0 by a certain value of the 
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wave function, then the same state occurs at point sı at the instant 
So 


of time host, at point sə at the instant of time t,=—, etc. Here 


s is the coordinate reading along any rectilinear or curvilinear path. 
The differential equation 
Cs Tal ay 
ðs? v2 ĝt? 


is the general equation of a wave process and is valid,for any medi- 
um, including nonhomogeneous medium where v varies from point 
to point. 

If it is necessary to express the wave function in terms of the three 
space coordinates x, y, z, the generalised wave equation has the 
following form: 

aw aay ae 1 aw 
Ox? Jy oy? ar a2 v? Ot? 


` The sum of the second partial derivatives of a function is concisely 
designated by the symbol AW (read: Laplacian of ¥). Thus, 


1 ey 
Wm? 


The differential equation of a wave is valid for any process in 
which the value of the wavelength and the amplitude of the wave 
vary from point to point. a 

Let us designate the amplitude of the wave function ¥ by 1p. For 
most problems, it is primarily 1p that interests us. If a vibratory 
Process of frequency © occurs in a region, then, in the most general 
Case: 

DES oan 

at? 
Therefore, a wave function will always satisfy the equation 
Np 


r W that is a function of time always 
nce, the last equation is the equation 


ye it ma 
= o y 


The part of the expression fo 
Cancels in such an equation. He 


for the wave amplitude 1p. By means of the relation A 


also be written in the form 


Sometimes this equation too is called the wave equation. 
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129. Propagation of Radio Waves 


Radio waves are propagated in accordance with the laws of reflec- 
tion and refraction. In order to obtain the concrete results required 
for our discussion, it is merely necessary to generalise the theory to 
the case of a medium having a continuously varying coefficient of 
refraction. But this has already been done for elastic waves (see 
p. 145) and is completely applicable for electromagnetic waves, 
i.e., for light as well as radio waves. A wave travelling in a medium 
of variable n, i.e., a wave travelling with variable velocity, is pro- 
pagated in such a manner that the least amount of time is taken to 
traverse the distance between two points. The path of the wave will 
be curvilinear and in passing from one layer of the medium to another 
layer where n is greater the wave will be deflected toward the normal 
to the boundary. 

In order to determine the nature of the radio 
the electrical properties of the Earth and the 
known. The electromagnetic field of the w 
the magnitudes of the electrical con 
constant of these two media. 


How is the difference in behaviour of electromagnetic waves of 


different length explained? Of course, a significant role is played by 
dispersion. But an approximate indic 


electromagnetic wave may be obt: 
relationship between the displ 
current. It is evident that a r 
when the displacement cu 


-wave propagation, 
atmosphere must be 
ave is greatly affected by 
ductivity and the dielectric 


, if the displacement current is negli- 
considered totbe a conductor. 

nd the properties of the 
standpoint. 


I perience in the field of radio 
engineering has shown that a flat terrain covered with trees may be 


characterised by a dielectric constant € of the order of 12 and a spe- 
vity y of 7 x 107 (in the Gaussian system). 

i aves over the surface of a sea, it is 
and y for sea water. These values 

e ratio of the conduction current 


nt density (see p. 319 for the re- 
e formula 


i long waves, say 2,000 metres, 
this ratio is equal to 77 for a woode 


d area and to 1,600 for a sea 
surface. The medium in each case, but especially in the latter, may 
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be considered to be a good conductor. For short waves, say 20 me- 
tres, the first value decreases to 0.77 and the second to 16. This means 
that for short waves sea water continues to be basically a conducting 
medium, while a wooded area acts to a considerable extent as a 
dielectric. 

Waves propagating over a conducting surface “cling” to this sur- 
face. The electric flux lines approach the Earth at right angles and 
travel along the terrestrial surface. That is why an electromagnetic 
wave can easily travel around the globe. (It takes 0.13 sec to do this 


Fig. 139 


and since this time can be measured quite accurately we can deter- 
mine the propagation velocity of radio waves.) This applies to long 
waves. Short waves cling only to sea surfaces. Tn other regions, they 
behave like perfectly free waves. When such a wave travels along 
the surface of the globe, it penetrates the Earth and is absorbed. 
Moreover, the higher the oscillation frequencies, the greater the 
absorption. 3 4 

A number of remarkable features in the behaviour of radio aver 
is explained by the presence in the upper layers of the atmos piers ° 
a layer containing a large number of free ions and electrons. his 
layer is known as the ionosphere. Thus, the region in which an elec- 
tromagnetic wave travels may be roughly pictured as a dielectric 

ounded by two conducting layers. ; y 

EREN. of the atmosphere is not uniform, i.e., the number of 
free charges per unit volume varies from one layer to the next. As 
was seen in Sec. 125, the coefficient of refraction decreases as the 
Number of charges increases. Since the coefficient of aaa of 
a conducting medium is less than unity, a wave entering the iono- 
Sphere at an angle from a dielectric medium is deflected from the 


normal. The ionisation increases; hence, the deflection increases 


with each succeeding layer. her 
Furthermore, as Fig: 139 shows, a wave may either pass through 


the ionosphere and recede from the Earth or, after being bent more 
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and more, return to the Earth. 
_ the nonuniformity of the ionosphe 

if it strikes the ionos 

reflection angle: 


Roughly speaking—disregarding 
re—a wave returns to the Earth 
phere at an angle greater than the total internal 


F } 4.N2e 
sin 0) >n= V1 Saal 
For smaller angles, the wave is 


propagated into outer space. By 
repeatedly being reflected from t 


he ionosphere and the terrestrial 
surface, short waves can round the globe, experiencing considerably 
less energy losses than in the case of long waves. 

Since uhf waves can pass through a layer of free charges, they are 
not reflected from the ionosphere. Therefore, radio reception on uhf 
is possible only along the line of sight. 

e of the atmosphere is 
shown that the densit 
atmosphere is charact 


greatly oversimplified. 
y distribution of free 


, depending on the tir 


al properties of the ionosphere and. 
engineers have drawn a number of 
e conditions for radio 

tious lengths. However, 
we shall not go into this subject. 


130. Radar 


A radar station consists of transmitting and re 


ceiving equipment. 
Every ten-thousandth of a second (A in Fig. 14 ea 


), the transmitter 


ale Lee eee 


sends a pulse of duration 
into space. If an object capa 
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wave is reflected back to the radar station. The reflected signal is 
2: 
received r= sec after the pulse is transmitted into space. 


This time may be measured by means of an oscilloscope. The 
sweep of the electron beam is synchronised with the transmitted 
pulses, and the demodulated signal from the receiver is fed to the 
second pair of oscilloscope plates. As a result, a “pip” displaced 
relative to the initial point of the sweep by a distance proportional to 
the time t appears on the oscilloscope screen. If the object intercepted 
by the radar pulse is stationary, the “pip” on the oscilloscope screen 
will also be stationary. Thus, synchronisation is achieved when the 
sweep time equals one ten-thousandth of a second, the interval of 
time between successive transmitted pulses. If the object “seen” by 
the radar is moving, then the pip on the oscilloscope screen also moves. 

Modern radar systems are much more complex than indicated by 
this simplified picture. The motion of the electron beam of an oscil- 
loscope from the centre to the edge of the screen is more complex 
than motion along a radius. While moving outward along the radius, 
the electron beam slowly rotates about the centre of the screen like 
the hand of a clock. This rotation is synchronised with the rotation 
of the radar antenna in such a manner that the illuminated line 
Points in the same direction as the transmitted radio beam. In addi- 
tion, the following important change in the operation of the oscillo- 
. Scope is introduced: If the radio beam does not encounter an obsta- 
cle, and hence the receiver does not pick up a reflected signal, the 
oscilloscope screen remains dark. On the other hand, if a pulse is 
received, a spot on the screen is illuminated. : 

Thus, when a beam scans the horizon and encounters a body, this 

ody is indicated on the oscilloscope screen by an illuminated spot. 

le distance of this spot from the centre of the screen is propontidmn 
one distance of the radar from the object, and the azimuthal angle 
Mdicates the direction of the object. 3 ; 
scilloscope arer possess an afterglow; hence, m illuminated 
Spot does not disappear while the radar is scanning the anai T 
efore returning to the same position. If the illuminated spo 1e E 
to the beam reflected from a fixed object, the image on the osci T 
Scope screen is also fixed. If the object moves, a moving Image wii - 


aPpear on the screen. . : 
Due to the difference in the reflection coefficients of vanes ob- 
J€cts, a characteristic picture of the region is depicted on the screen 


; ros 5 ing. River: lakes appear 
a radar system with circular scanning. Rivers and i 
dark (little AOA the Earth appears lighter and woods still 
ighter. Of course, metal objects are “seen” very clearly. 

The nature of the visibility varies in accordance with the wave- 
lengths used. Thus, for radio waves in the centimetre range, clouds 
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are seen very clearly. Since longer waves are insensitive to cani: 
and rain, radar systems operating on such wavelengths can be usec 
in all kinds of weather if they are not intended for the specific pur- 
t detecting clouds. za ; 
ae principles find broad application in science and engineering: 
Thanks to radar, pilots experience no difficulty in conducting nig at 
flights and in landing on airports which are not illuminated. Radar 
is of great importance in meteorology. In addition to enabling us to 
detect rain and storm clouds at great distances or at night, which 


Transmission Reception 
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Antenna change- 
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switel 
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Fig. 144 


casting, radar can be used to track mete- 


is essential for weather-fore 
orological balloons. Radar equipment installed on ships is a great 
aid to navigation safety, reducing to nil the 
collisions of a vessel with other 
of astronomy, radar methods are 
meteors, and the direction and velocit 
reflected, in the main, from the mete 
ionised gases. The Mo 
reach of radar. R 


and not physics, 


the principles of this remarkable dey 
Fig. 141 shows a block diagram of 


CHAPTER XX 


INTERFERENCE PHENOMENA 


131. Addition of Waves from Two Sources 


First, let us consider two ideal sources radiating spherical waves. 
Assume both sources are oscillating synchronously. In this case, for 
waves of any type, a characteristic field is created in which bright 

_ and dark “fringes” appear where the waves reinforce and cancel each 
other, respectively. This phenomenon is most 
easily demonstrated by means of water waves. 

„ The mathematical calculations are straight- 
forward. Take any point a distance rı from. 
one wave source and rz from another. Then, 
Maximum reinforcement of the waves occurs 
when the path difference r4 — T2 equals a whole 
number of wavelengths, nd. On the other hand, 
the waves annul each other when the path 
ifference equals an odd number of half- 


wavelengths (22+ 1) >. 


We know from analytical geometry that a 
curved surface all of whose points satisfy the 
following condition is a hyperboloid: the 
difference between the distances to two foci is 
a Constant. In Fig. 59 (p. 123) aplaneisdrawn ` 
through the wave sources. Shown in this plane are hyperbolas— 
-loci for which the difference between the distances to the wave 


Sources is a constant. 
Now, let us consider the pa 
in indrical screen whose axis pa 
„Fig. 142. (We shall assume 


Fig. 142 


ttern obtained in a wave field on 
sses through the radiators as shown 
throughout that the radiators are 
ll consist of alternating 
the conditions are exactly the 


Same for all points on the cylinder located 
Telative to the radiation sources all such points ] 

bright band will appear along a line around the middle of the 
Ylinder; since the distance from both sources is the same, the waves 
reinforce each other. For points at a height z above the mid-line, 
the difference in the paths traversed by the rays, ™ — Tz, will be 


are in the same state. 
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rir? 
represented by Ee 
between the sources. 


has the form 


. But r?—r;=2lz, where l is the distance’ 
"Thus, the condition for the n-th bright fringe 


Be =n. 
Tarish2: 


If the radiators are far from the screen, then for fringes close to 
the centre, 


ry +r. = 2k, 
where R is the radius of the cylinder. The bright bands pass through 
the points z satisfying the condition 
E =nÀ. 
The distance between adjacent fringes is Az = i 


(see Sec. 132) “separated by a distance 


à = 6,000 A, then the distance between 
on the surface of a cylinder of radius R = 1 metre is Az = 


Example. If two coherent sources 
l = 1 mm radiate light of wavelength 
interference fringes 


E mm. 


If the light source radiates waves of various wavelengths, the 
interference pattern will be colou: 


red since the maximum conditions 
differ for different values of A. 


In addition to determining the positions of maximum and mini- 
mum interference, it is also of interest to determine the form of the 


f intensity Gury acro e fringes. 
Since a is the path difference 
i between the waves, then 
T 
E a R 


is the phase difference, and the 
total amplitude at any point is 
given by 

Acos ot + A cos (wt +6). 


For equal amplitudes, we obtain 
the expression derived on p. 103: 


ò ô 
Fig. 143 2A cos > cos (ot Efe 5) 5 


; s The measured intensity (wave 
amplitude squared) is equal to the average value of this expres- 
sion taken over the oscillation period. - 


Since 
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(see the following article regarding calculation of the average), then 


alz 


= 242 cos? 
T= 2A? cos IR 


The intensity curve may be plotted as a function of the vertical 
coordinate z (Fig. 143). 
132. Coherence 


The superposition of two waves, described in the preceding section, 
may be physically achieved by various means and for various wave- 
lengths. For example, two antennas radiating radio waves may be 
placed close together, two electric bulbs with point filaments may be 
placed close together, or an incident ray and the same ray reflected 
from a mirror may be brought together. Experiments show that by no 
means does interference occur in all cases. This phenomenon can best 
be investigated by examining the superposition of the fields of two 
antennas. It is easily shown that an interference pattern is obtained 
only when a phase difference that remains constant during the time 
of observation exists between the superimposed waves. In this case 
the oscillations are said to be coherent. 

If the phase difference is fixed, the amplitude of the electromagne- 
tic oscillations at a given point in space is constant. Thus, a maxi- 
mum point remains a maximum and a point where the waves comple- 
tely cancel each other always has zero intensity. : 

When the phase difference varies randomly, the pattern is comple- 
tely different. During a certain interval of time the amplitude of 
the oscillations at a given point is a maximum, in the succeeding 
interval it assumes intermediate values, and then for an interval of 
time the waves cancel each other. If the duration of these intervals 
were commensurable with the practical capabilities of instruments, 
a fluctuating interference pattern could be detected. If the variations 
in phase difference are so rapid as to preclude detection by these 
instruments, the interference pattern is not revealed and the average 
value of the intensity is shown on the instruments. In such cases, 
We say that the oscillations are noncoherent. > 

What is the expression for the average intensity in a region where 


elds are superimposed? This is easily determined. ; 
The amplitude of the total wave at a given point and at a given 


instant may be expressed in the form 

‘A, cos wt + Ap CoS (at +ô). 
The instantaneous intensity is proportional to this expression 
Squared, i.e., it is equal to 


A? cos? wt + A, cos? (ot + 8) + 2A,Ap cos ot X cos (ot + ô). 
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We are interested in the time-averaged intensity of the radiation, 
i.e., 


T = A? (cos? @t)av-+ 4; [cos® (0t +8) Jav +2414 [Cos wt X cos (ot+-6)]av- 


The average values of trigonometric quantities are encountered quite onen 
in physics. It is therefore useful to recall that the average values of sin x ang 
cos x are equal to zero, and the average values of sin®x and cos2z are equa 
to 1/2, if the argument z of the trigonometric function assumes all values with 
equal probability. The average value of a function f (x) is by definition 


Moa- LDL ead a EF En) 


n 


This formula can be used to calculate the average if the variable z assumes 
discrete values. But if the variable z is continuous and assumes all values in 
the interval from a to b, the formula for calculating the average value is ob- 
tained in the following manner. Divide the interval (b — a) into n segments of 
length Az. Multiplying the numerator and denominator by Az, we obtain 


A. 9) Az... 
[7 Map Len Az ad bet a 
Going over to the limit, this takes the form 


b 
TO 


By means of this formula, we can calculate the average value of any func- 
tion of a continuously varying random quantity. In calculating the average 
value of a periodic function, a single period should be used in determining the 


limits of integration, for the average value of one period is clearly equal to 
the average value of any number of periods. Thus, for example, 


T 
1 1 
[cos? zJay= >= $ cos? z dz = Tr: 
0 


Let us write the formula for the intensity in the form 
T= Aj (cos? Wf) gy + A} [cos? (wt + 5)]ay + Ay Ao [cos (204 + 8) lav + 
+ A; Ap (cos §)ay. 
Using our knowledge regarding the average values of cos z and cos? ©; 
we obtain: for a phase difference between two waves varying ran- 
domly, i.e., for noncoherent oscillations, 
T=+ (A+ Ad); 


on the other hand, if the phase difference is fixed, i.e., if the oscilla- 
tions are coherent, 


T= 5 (Ai Ad) + AAs cos 6 
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2 


or, for equal amplitudes, 
T =2A? cos 
a is the same as the interference formula derived in 


neighbouring antennas may be made 


eans. 
t first glance it would seem 


This last formul 
he radiations of 


the preceding article. 
Radio waves radiated by 
coherent or noncoherent by technical m 
S, a 


As regards oscillations of light wave: 
that it is impossible to achieve coherence since t 
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er. The 


| 
| 
| individual atoms have no definite relationship to one anoth 
| Phases of the waves emitted by individual atoms are haphazardly 
distributed. It is quite natural that two light sources, no matter 
mate point sources, do not yield an inter- 
less, coherent light oscillations do exist. 
d the same light wave. 
nt sources are shown in 


Low closely they approxi 
erence pattern. Neverthe 
hey occur for rays “taken” from one an 
‘Methods of artificially securing coherent | s ar ; 
ig. 144. In one case two mirrors that are slightly inclined relative 
the other a double prism (biprism) is 


© each other are used and in 
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used to produce wave sources from two virtual centres. Interference 
fringes may be observed on a screen placed anywhere in the inter- 
ference field. The theoretical discussion presented above is entirely 
applicable to these cases; the nature of the fringes is determined 
by the distance between the virtual images of the light source and 
the distances from these images to the point of observation. 

It is quite clear why the two parts of the “divided” ray are coher- 
ent. Between any pair of atoms of a real source, there is no coherent 


relationship. However, by divid- 
ing such a ray into two parts, 
we enable the radiation of each 
atom to interfere with itself. 
Thus, consideration of various 
cases of light} interference is re- 
duced to the investigation of va- 
rious cases of division ofa light ray 
by reflection and refraction, and 
the successive superposition of the 
components of the divided wavein 
the region of the interference field. 
The size of the light source 
significantly affects the coherence 
of the “divided” 
ource is of length b 
the solid angle 
2u take part in the creation of the 
such as Z and /’ emanate from 


1\, 12 1 


I e due to the Superposition of the fields 
of rays 1, 2 and 7 , 2’. For interference „to occur, the fields of the 
coherent beams 1,1 and 2,2’ must reinforce each other, The path 
difference A = b sin u, which exists between Z’ and 2 as well as 
between 7 and 2, tends to prevent this. Moreover, interference be- 
comes possible only when b sin ig le 


133. Interference in a Plate 


Let us investigate reflection and refr ti i incident on 
a flat plate of thickness d (Fig. 146). ion of light inciden 


ssume a plane waye impinges on the lat i. The 
beam of light is reflected and refr Ae E a 
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in the second medium. All these rays are coherent and a phase differ- 
ence exists between them. Hence, the conditions are present for 
interference in the reflected as well as in the transmitted rays. 

As is well known (see p. 336), the reflection coefficient is not very 
large—at least, for normal incidence. In this case, the intensity of 
each “succeeding” ray is much less than the intensity of the preceding 
one. For example, for a reflection coefficient of 5%, the first reflected 
ray has an intensity of 
0.05 To. The second reflect- 
ed ray undergoes two re- 
[fractions and one reflection. 
[ts intensity is-0.95 x 0.95 x 
x 0.057% = 0.04579. Thus, 
the intensities of the first 
two rays are practically the 
Same. But the third ray is 
much weaker since it un- 
dergoes three reflections and 
‘Wo refractions. Its intensity 
'Sequal to 0.95 x 0.95 x 
Xx 0.05 x 0.05 x 0.05 Zo, 
1€., one four-hundredth of the intensity of the preceding ray. 

nder conditions of small reflection coefficient, the phenomenon 
reduces to the observation of the interference of the first two rays. 

. AS regards the transmitted rays, under conditions of small reflec- 

tion Coefficient the interference is not noticeable since the intensity 
or the second ray is one four-hundredth of the first, the intensity of 
the third is ‘one four-houndredth of the second, etc. However, it is 
not very difficult to set up the experiment in such a manner that 
„D the reflected as well as in the transmitted beam numerous inter- 
erence rays occur. , « 

a monochromatic wave impinges on a flat plate, the interference 
Pattern is determined by the phase difference between the first and 
Second reflected rays. 

‘tom the wave formula 


A cos œ (:—=) , 


it is evident that the phase of the wave, traversing the path x with 


velocity v, changes by ot, or x, where Vis the wavelength in the 


Medium. Designating the wavelength in vacuum by A» and recalling 


that the refractive index n is equal to 4s , the change in phase may 


be Written in the form P ne. The product nz is often called the opti- 
0 


23-4 409 


= 
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cal path of the wave. If a wave passes through several media, its 
25 1 ! 7 {= 

phase changes by S, where S = (mx; + not. + . . .) is the opti 
Lo 


cal path. à ' , , 

The phase difference ô between the interfering waves, which deter- 
mines the intensity of the resultant wave, is given by 

2; 
5=4 A, 

e., it is determined by the difference between S$’ and S”, the optical 
paths of these waves: A = S’ — S”. , FA. 

Referring to Fig. 146, let us calculate A for the case which interests 
us. It is most convenient to express A in terms of the refraction angle 
r, the plate thickness d and the index of refraction n. As scen from 
the figure, 


A=2dnecosr. 


However, in addition, it is necessary to take into account the phase 
jump occurring upon reflection (cf. pp. 335-36). In this respect, the 
first and second rays differ, for the first is reflected from the external 
surface of the plate, while the second is reflected from the internal 
surface. Therefore, the electric vector of one of the rays undergoes 


a 180° phase jump and the other does not. Thus, the resultant phase 
difference is 
2x 
ô= 2dn cosr + x. 
a . 

Maximum interference occurs when ô = m27, where m is a whole 
number; minimum interference occurs when 6 = mx. Therefore, 

2 ones i Ao = 

maximum condition: 2dncosr=mAg + = 


minimum condition: 2dn cos r = mhg. 


Thus, depending on the value of à, n, d and r, interference may 
cause the intensity of a wave reflected from a plate to be zero or 
a maximum. In an ideal experiment with a monochromatic beam, 
by varying the angle of incidence, for example, a reflected ray 
should alternately vanish and reappear. In an analogous experi- 


ment with a beam of white light, the plates should pass through all 
the colours of the rainbow in succession. 


134. Fringes Representing Equal Thickness and 
Fringes Representing Equal Inclination 


Several factors enter into the extremum condition 2dn cos r = MÀ. 
Hence, if they are varied simultaneously, a confused picture may 


result. The effect is clearest when all the parameters, except one, 
may be considered fixed. 
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If a plate has a variable thickness d, a constant refractive index 
and a practically constant angle of incidence (and hence angle of 
refraction) for the portion of the plate under consideration, the inter- 
ference will be observed in the form of fringes representing equal 
thickness. All parts of the plate having the same thickness d will be 
subject to the same conditions. Therefore, on an uneven plate, there 
appears a system of bright and 
dark fringes (or rainbowed in case 
of white light). These lines con- Pes 
nect points where the thickness 
of the plate is the same. This 
explains the coloured fringes often 
seen on oily films spread on the 
surface of water. If a plate is 
Wedge-shaped the fringes repre- 
Senting equal thickness consist 
of straight lines. Such fringes 
may be easily observed on soap y 
films. In the case of a vertical film, the soap trickles down | and 
the film becomes thinner in the upper region; horizontal fringes 
appear on the film. i 
When light impinges normally on a plate, cos r ~ 4 and fringes 
appear on the plate where the thickness d satisfies the relation 
n = mho. x 
The difference in the thicknesses of the plate represented by adja- 
cent fringes equals i a i.e., a half-wavelength. Thans brighe 
fringes representing equal thickness indicate nonuniformities in 
Plate thickness of the order of a tenth of a micron. : 
If the thickness from one point to another varies very slowly, the 
ringes may turn out to be very far apart. Thus, for example, in. 
a dripping soap film a wedge may form having a 0.5-minute angle; 
in this case, as can be easily calculated with the aid of Fig. 147, the 
fringes will be 2 mm apart. 

fa wedge decreases to zero thich 
Pears dark in the reflected light since thicknesses less than z 


nja 


Fig. 147 


kness, the end of the wedge ap- 
do not 
reflect light. The first bright fringe occurs for the thickness d= F 
he return path must also 
bright fringe occurs for 
rmined by simply count- 


(the Path difference is twice as great since tl 
q_ included in the calculation). The next 
i =A, etc. Thus, the thickness may be dete 
ng the fringes. 
The question naturally aris 
Wie kness easily observed on t 
indowpane? The answer is t 


es: Why are fringes representing equal 
hin films but not, for example, on a 
hat it is not possible to create the 


23* 
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ideal conditions whereby the single variable quantity is the plate 
i ss d. : 
Cope consider the effect of a spread in the angle of incidence 
(refraction). If the angles vary from 7, to rs in such a manner that 
on the interference maximum for r, there is superimposed the extinc- 
tion for rz, the interference fringes will be smeared. What is the 
value of the angular interval Ar = T2 — r; that smears the pattern 


of fringes representing equal thickness? The value can be determined 
from the conditions: 


4 
2dncosr,=md and 2dn cos r, = (m -+ z) A, 


whence 


2dn (còs ry — cos ra) = 


tol > 


For simplicity, let us restrict ourselves to the case of normal inci- 
dence; let r equal zero and r, a small value Ar. Then, 


2dn=mh and 2dn (1 —*) = (m+) A. 


Hence, 


and since 2dn = mÀ, 
3 (Area. 


If the plate is thin, the values of m are measured in units or tens of 
units. In this Case, an angular spread of about a tenth of a radian, 
i.e., 5°-10°, does not smear the pattern, However, for a plate 4 mm 
thick, m is already of the order of 5,000. Here, an angular spread 
of the order of gnly a hundredth of a r 


| adian suffices to prevent obser- 
vation of fringes representing equal thickness, 


However, even if the geometry is ideal, relatively thick plates do 
not yield an interference pattern. This is due to the fact that real 
light is not ideally monochromatic. An ideal Wave is an infinite 
sinusoid. But an atom radiates for a very short period of time. 
Experiments show that a real wave is 
length is of the order of ten to a hun 
interference to occur, the path difference 
of the sinusoidal waye train; ence does not exist 
between the Superimposed waves. If the length of the sinusoidal 


Wave train is equal to 100 mm, the path difference corresponds to 
a value of m of the order of 150,000. : 


e must not exceed the length 


134. Fringes Representing Equal Thickness and Equal Inclination 357 


Now, let us consider fringes of another kind, namely, fringes 
representing equal inclination. Such fringes may be observed when 
a beam of light with a continuous spectrum of incident angles im- 
pinges on a plate having parallel surfaces, i.e., a plate for which 
d is the same at all points (Fig. 148). 

Consider a beam of reflected rays contained in a given solid angle. 
Let us direct our attention to those rays lying along generating lines 
of a cone whose axis is normal to the plate. All rays lying on such 
a cone have the same value of r and yield lines representing equal 
inclination. 

The differences in the method of observation of lines representing 
equal thickness and lines representing equal inclination should be 


Fig. 148 


noted. Since lines representing equal inclination occur at infinity, 
a lens must be placed in the path of the rays to make them visible. 
This enables us to observe curves representing equal inclination in 
the focal plane of the lens. For normal incidence, lines representing 
equal thickness may be observed with the naked eye on the surface 
of a wedge. If the light impinges on such a plate at an angle, lines 
representing equal thickness may be observed on the surface of 
a wedge only in the case of very thin films. Otherwise, the interfer- 
ence pattern is observed in two planes located above and below 


i i . D . 
the wedge at a distance d= , where & is the wedge angle. To derive 
SL 


this formula — which we leave to the reader—plot a ray incident on 
the surface of the wedge at an angle i and the rays reflected from the 
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upper and lower surfaces. The observation plane of the interference 
pattern will pass through the point where the prolongations of the 
two reflected rays intersect. Note that the above formula is only 
valid for the case of an air wedge. 


135. Practical Applications of Interference 


Interference methods are widely used for the measurement of 
small distances and small changes in distances. They enable us to 
detect thickness changes of less than one-hundredth the wavelength 
of light. An accuracy of 10-7 em may be achieved in the measure- 
ment of the unevenness of a crystal surface by interference methods. 


Fig. 149 


Many applications are based on the use of curves representing 
equal thickness. This method is widely used in the optical industry. 
For example, in order to check the quality of the surface of a glass 
plate, an air wedge is created between the plate under test and a 
standard plate having an ideal flat surface and the fringes repre- 
senting equal thickness are examined. The air wedge is formed by 
pressingethe two plates together along one edge. If both surfaces 
are flat, the lines representing. equal thickness will be parallel 
straight lines. 

; Let us assume that the surface of the plate under test has a depres- 
sion or a bump. The lines representing equal thickness will then be 
distorted, i.e., they will bend around the defective area. If the angle 
of incidence of the light is varied, the fringes are displaced in one 
direction or the other, depending on whether the defect is a depres- 
sion or a bump. The patterns seen under a microscope in these cases 
are shown in Fig. 149. The first two pictures are those of defective 
samples. In the first the defect is located on the far right, while in 
the second the defect is on the left. The third picture is that of a 
sample without defects. 
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This method may also be used for very accurate measurement of 
the coefficient of expansion. For this purpose, an air gap must be 
created between the surface of the object under test and a perfectly 
flat surface. As the object expands, the thickness of the air layer 
changes and the fringes representing equal thickness begin to move. 
If a line is displaced to such an extent that the next one takes its 
place, the thickness of the air layer at this location has changed 


by 2 If, as is usually the case, the measurement is performed using 


monochromatic light, the fringes are very sharply delineated and the 
displacement of a line by 
one-hundredth of the dis- 
tance between lines may 
be measured. 

Accurate measurement 
of the refractive index of 
a substance may be per- 
formed by means_ of an 
interference refractome- 
ter. In such an instru- 
ment, the interference 
between two light rays 
that are separated as 
much as possible is ob- 
Served (Fig. 150). For this Fig. 150 
Purpose, a thick plate is 
used and a convenient 
angle of incidence selected ( 


for ordinary glass, the best angle is 
about 50°). The rays travelling between the plates are separated 
and the substance being tested is placed in the path of one of them. 

his changes the optical path of this ray and hence the path differ- 
ence between the interfering rays. If the interferometer plates are 
exactly alike and perfectly parallel, the interfering rays cover the 
same distance and reinforce each other. If the plates are inclined 
With respect to each other, a path difference is created and the clar- 
ity of the field being observed is reduced. : 

Such is the situation for a beam of perfectly parallel rays. But if 
4 slightly divergent beam impinges on the plate, a system of fringes 
representing equal inclination is seen through the eyepiece. In this 
Case, the variations in the optical path difference are conveniently 

flermined by counting the interference fringes passing the cross 


air of the instrument. tee ’ 
et us seco ait a body of length / and refractive index n is 
Placed in the path of one of the rays. If the refractive index of the 


dium is mp, the optical path difference changes by A = J (n — No)- 
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Therefore, 4 fringes should pass through the eyepiece of the instru- 


ment. The accuracy of this method may be easily gauged from the 
fact that a displacement of one-tenth of the distance between lines 
is easily detected. For such a displacement, A = 0.1} = 0.5 X 
x 10- em, which for a length ¿ = 10 cm enables us to distinguish 
a change of 0.5 x 10- in the refractive index. 

The well-known Michelson interferometer (Fig. 151) is used for 
the accurate measurement of length as well as for the determination 
of the velocity of light (see 
p. 417). In this instrument, 
a parallel beam of mono- 
chromatic light impinges 
on a glass plate having pa- 
rallel surfaces, one of which 


l ` 
; (> E is covered with a trans- 
Ji - lucent layer of silver. This 
plate is placed at a 45° angle 
to the incident beam. As a 
result, the beam is divided 
Ws into two parts. One part 
: moves parallel to the pro- ’ 
Fig. 151 


longation of the incident 

beam and the other is di- 
rected perpendicular to the incident ‘beam (to the left). These 
rays impinge normally on two mirrors and return to the same points 
on the translucent plate from which they came. Each ray returning 
from the mirror is repeatedly divided at the plate. Part of the light 
returns to the source and the other part enters the telescope to the 
right. As a result, two coherent interfering rays appear in the field 
of the telescope. It is seen from the figure that after the first division 
at the lightly silvered surface the ray coming from the mirror opposi- 
te the telescope passes through the half-silvered plate twice. There- 
fore, in order to provide equal optical paths, the ray coming from 
the other mirror is passed through an equalising plate identical 
to the first plate, but without the translucent layer of silver. 

In the field of the telescope there appear lines representing equal 
inclination (rings), Corresponding to interference in an air plate whose 
thickness is the difference in the distances of the mirrors from the 
translucent layer. The displacement of one of the mirrors by a quar- 
ter of a wavelength corresponds to the transition from a maximum to 
a minimum, i.e., it results in the displacement of the pattern by 
“half a ring”. Such a change may be easily detected by an observer. 


Thus, the sensitivity of an interferometer using rays of violet light 
is better than 1,000 A, i.e., 0.4 micron. 


at 
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An ‘interesting application of the Michelson interferometer prin- 
ciple is the microinterferometer developed by the Soviet physicist 
V. P. Linnik. In this instrument, one of the interferometer mirrors 
is replaced by an object to be investigated. Lines representing equal 
thickness are observed in the image plane. Dimension details of 
0.1 micron yield a sharp transition from maximum to minimum 
illumination. A microinterferometer is generally constructed in the 
form of an adapter toa usual microscope and screws into the drawtube 


in place of the objective. 
Of great importance in sci- 
ence are interference mi- 
croscopes, which in principle 
are similar to the micro- 
interferometer. (In the mi- 
Croscope, interference does 
not occur in the image 
plane, but occurs rather in 
front of the usual objective, 
i.e., the interference image 
has the same dimensions as 
the microobject.) A double- 
ray interference microscope 
does not yield a gain in 
magnification. The advan- 
tage of this method lies else- 
where. For example, we 
often encounter objects of 
microscopic investigation 
that are either entirely f 
transparent or vary little in transparency from one point to another. 
Before this method was developed, the details of such objects could 
be made visible only by dyeing (generally speaking, different struc- 
tural elements absorb dye differently). But dyeing has little appli- 
cation in the investigation of living microorganisms. Fig. 152, which 
shows a microphotograph of frog’s blood magnified 300 times, illus- 
trates the possibilities of the interference microscope. The drawbacks 


of this method include considerable loss of light in the interference 
Mechanism ard complexity of the microscope s optical system. 

further increase in the sensitivity of interference methods and, 
therefore, a further advance in the field of interference microscopy 1s 
Possible by going over from a double-ray interferometer to a multi- 
Yay interferometer. In double-ray interferometers, the screen illu- 
mination is proportional to 4 + cos kh, where h is the displacement 
along the screen. Due to the smoothness of the transition from maxi- 
mum to minimum illumination, a small displacement of the inter- 


Fig. 152 
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ference fringes is difficult to detect. In multi-ray interferometers, 
the situation is considerably improved. As an example, let us 
consider the Fabry and Perot interferometer. A Fabry and Perot 
interferometer (Fig. 153) consists of two rather thick glass or quartz 
plates partially silvered on their adjacent sides. These surfaces are 
perfectly parallel to each other and the region between the plates 
contains air. When a beam of light enters the interferometer, the 
beam is divided into a transmitted and reflected component every time it 
impinges on one of the layers 
of silver. Thus, in the trans- 
mitted as well as in the 
reflected light, there is ob- 
tained a set of coherent 
light beams whose intensi- 
ties decrease in geometric 
progression (see p. 352) and 
whose phases are displaced 
in arithmetic progression. 
In order to secure interfer- 
ence of a large number of rays, the decrease in amplitude during suc- 
cessive reflections must not be large. This condition is met when the 
silver coating on the plates has a reflection coefficient of 0.9 or more. 
The intensities of the rays in the transmitted light are then quite 
low, but there will be little variation from ray to ray. This makes it 
possible for a large number of rays (as many as 10-15) to take part 
in the formation of each illumination maximum. 


The interference pattern is obtained in the form of the usu 
representing equal inclination, 


al rings 
but with one very important differ- 
ence, namely, the principal maxima determined by the condition 
2dn cos œa = mh are now much narrower and their intensities are 
tens of times greater than the intensity of the background between 
them. Therefore, the interference pattern assumes the form of very 
narrow bright fringes separated by broad dark intervals. The dis- 
placement of such a narrow maximum may be determined much more 
accurately than the displacement of a fringe in a double-ray inter- 
ferometer. An analogous narrowing of maxima occurs when the 
number of slits in a diffraction grating is increased. 

Thus, multi-ray instruments sharply increase the sensitivity of 
interference methods. Such systems are indispensable in the investi- 
gation of the vertical structure of an object surface in reflected light. 
The magnification of details in the vertical direction may reach 
a value of 400,000, which makes it possible to reliably resolve details 
of the order of 5-10 A. ‘This is only several times greater than the 
distance between atoms! An example of such photography is the 
picture of the spiral growth of a crystal shown on p. 640. 


CHAPTER XXI 
SCATTERING 


136. Secondary Radiation 


Under the action of an electromagnetic wave, every molecule 
becomes a secondary radiator of electromagnetic waves. Due to the 
electric force, the electron cloud is displaced relative to the atomic 
nuclei and the molecule acquires a dipole moment varying in time as 
the frequency of the incident wave. The behaviour of such a mole- 
cule differs in no way from the behaviour of the elementary dipole 
discussed in Chapter XX. The intensity of the secondary wave is 
given by the formula derived on p. 326 (intensity ~ or sin? 0) and 
the spatial intensity distribution of the secondary radiation is shown 
in Fig. 133. 

In a number of cases, which will be discussed below, the phenome- 
non of secondary radiation leads to various phenomena of electro- 
Magnetic wave scattering. By scattering, incidentally, we generally 
mean any electromagnetic wave propagation phenomenon that is 
not included under refraction, reflection and rectilinear propagation. 

The intensity formula given above is valid for any electromagnetic 
wave. However, the fact that the intensity increases sharply with 
radiation frequency explains “why the effects of wave scattering by 
a molecule are not detectable when the wavelengths are very long. 
The scattering intensity of visible light is quite sufficient to produce 
Significant effects. 

Light wavelengths are hundreds 


than the dimensions of ordinary molecu s 
trons of a molecule are made to vibrate in the same phase by the 


external field. For light waves, ultraviolet rays and even very soft 
A-rays (i.e., X-rays of long wavelength), a molecule behaves like 


an elementary electric dipole. - > P 

The picture changes considerably in the case of X-rays having 
a wavelength of the order of 1 A. Now, the dimensions of the mole- 
cule are larer than a wavelength and different portions of the mole- 
Cule’s electron cloud vibrate in different phases. In order to deter- 
Thine the intensity of the scattered wave, we must take into account 
interference effects occurring between waves scattered by different 


Parts of the molecule. 2 We 7 
In principle, this is not very difficult. First, it is necessary to di- 
Vide the molecule’s electron cloud into small volumes. The dimensions 


and thousands of times greater 
les. Therefore, all the elec- 
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of each such volume, Av, must be much less than a wavelength. 
Then, the electrons in this yolume will scatter in the same phase. 
Designating the density of the electron cloud by p, we obtain pA» 
electrons in a volume Av;,. The amplitude of the secondary wave 
created by the k-thevolume is proportional to pAv,. The amplitudes 
` of the scattered waves are added with due regard to the phase differ- 
ences between the elementary waves and this sum is then squared. 
It is found that the intensity distribution of the scattered wave 
differs significantly from the radiation pattern of a single dipole. 
This is understandable, of course, for there will be directions in 
which elementary waves scattered by diferent volume elements 
reinforce each other, i.e., act in phase, and, on the other hand, 
directions in which elementary waves tend to annul each other. An 
important conclusion to be drawn from such calculations is the fol- 
lowing: When interference occurs between elementary waves ema- 
nating from different volume elements of a particle, waves travelling 
backwards tend to annul each other in the final analysis; on the 
other hand, those travelling forward reinforce each other. 

We have been discussing secondary radiation from a molecule, 
but often the secondary radiator is a much bigger particle, namely, 
one composed of numerous molecules. It may be a particle of dust, 
colloidal substance, fog, crystalline substance,: smoke, or a large 
albuminous molecule, etc. The nature of wave scattering by particles 
is determined by the ratio of their dimensions to the wavelengths 
of the exciting electromagnetic wave. If the particle is small relative 
to the wavelength, the wave is scattered as by a single elementary 


dipole. If this is not the case, interference effects occur and the for- 
ward scattering predominates. 7 te 

Different parts of a particle may possess different scattering powers- 
This is precisely the situation in the case of X-rays scattered by 
a molecule. A particle whose scattering power is the same throughout 
the volume is the simplest body from the standpoint of scattering 
investigations. We shall confine our attention to such a system, 
for not only are the calculations simple in this case, but, in addi- 


tion, a system of this kind is easily simulated experimentally by an 
aperture in an opaque screen. 


137. Wave Diffraction at Apertures 


The amplitude of a wave scattered by a particle is determined by 
the distribution of scattering material in the particle. Particles 
(“apertures”) may be encountered in which the density of the scatter- 
ing material gradually decreases with increasing distance from the 
centre of the atom. On the other hand, more pronounced nonuni- 


137. Wave Diffraction at Apertures 365 


formities may be encountered, e.g., inclusions and pores at whose 
edges the density changes abruptly. 

Peculiar diffraction effects arise when scattering takes place on 

such nonuniformities. With increasing angle the scattering intensity 
first gradually decreases to zero. Then, as the angle increases further, 
the intensity increases to a 
maximum value, whereupon 
it again decreases to zero. 
Subsequently, the wave-like 
nature of the curve continues 
with decreasing amplitude. 
Scattering on such objects 
leads to the formation. of 
diffraction bands and spots 
of various shape depending 
on the nature of the scat- 
tering object. 
_ The most pronounced dif- 
fraction effects are observed 
for scattering at apertures 
Made in an opaque screen. 
Every aperture may be 
Viewed asa region uniformly 
filled with radiating dipoles. 
The scattering patterns of 
an aperture and a particle 
having the shape of such 
an aperture should yield 
identical curves of intensity 
YS. scattering angle. 

For light rays, diffraction 
Patterns are best observed 
USing parallel rays in accord- 
ance with the following 
Scheme, The rays of a light Fig. 154 
eam emanating from a 3 A ` 
Source are made parallel and allowed to impinge on a Sass vee 
Various inclusions (if the screen is transparent) or are (i mie 
Screen is opaque) are located. A lens placed behind the screen im m 
Parallel rays into the plane of a photographic plate on soreèn iir , 
Observation of the effect. If there are no nonuniformities, apertures, 
etc., in the path of the rays, the lens gathers the rays in a point. 
Otherwise, a scattering or diffraction pattern Appeare bi eee 

Fig. 154 shows the diffraction patterns obtained in this manner 
trom (a) two needles and a thin wire and (b) a circular aperture. 
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To make these patterns more understandable, let us determine 
the intensity distribution of scattered radiation* for the simple case 
of an aperture having the shape of a slit. 

Let a wave impinge normally on a slit cut in an opaque screen. 
Considering the slit divided into volume elements AV as shown in 
Fig. 155, let us write the expres- 
sion for the wave emanating 
from an arbitrary volume element 
AV, at an angle ọ to the incident 
wave. The waves from different 
volume elements AV, arrive at 
the point of observation in differ- 
ent phases. If the path difference 
is measured relative to the outer- 
most ray (on the side of deflec- 
tion), the rays emanating from 
the other volume elements will 
cover a distance that is greater 
by the amount æ sin ọ. Hence, 


these rays will be displaced in phase by my sin @. 


The amplitude of the wave scattered by the k-th v 


olume element 
is proportional to the “scattering” volume element AV, i.e., to the 
expression 


ZO AVi 


2 
AV), cos (o -E ir sin p) $ 


It is necessary to take the summ 
volume elements. Instead of t 
with respect to x, where the 
slit. Replacing AV, by Azp, 
over to the limit, we obtain 
at the angle ọ: 


ation of the expressions for all 
aking the summation we may integrate 
x-axis is taken along the width of the 
which is proportional to it, and going 
for the amplitude of the scattered wave 


a 
A=k f cos (o1— = sin o) dz, 
0 


viet k is a coefficient of proportionality and a is the width of 
the slit. 


Introducing the variable 


2 4 
2=0t—* rsin P, 


* “Scattered radiation” and “diffracted radiation” have exactly the same 
physical meaning. The terms “diffraction” and “diffracted” are generally used 
when the scattering pattern has rather pronounced maxima and minima. When 


the nature of the interference pattern is not so evident, we speak of “scatter- 
ing”. 
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we obtain 
dz== = sin g dx 
and, therefore, 
in - [sino sin (o = sing) |. 


ae 
— sing 


A 


. . wa. . . P 
Designating z e by u and performing a trigonometric trans- 
formation, we obtain 


kass 
A=— sin u cos (ot — u). 


Thus, the resulting oscillation at the point of observation has the 


amplitude — sin u, i.e., the observed intensity is 


This is the formula for the intensity distribution as a function of 


the scattering angle. 

In most diffraction experiments, 
of the scattering angle p. The re 
later. Therefore, replacing sin ọ by tan Ẹ 


we are interested in small values 
asons for this will become clear 
and since 


tanp=+ 


Where æ is the distance of the point of observation in the plane of 
the photographic plate to the centre of the diffraction pattern and 
fis the distance from the slit to the plate, we obtain for u the expres- 
sion 

pe 
= Í 


A 
curve. Since w is proportion 


u? 4 
responds to the diffraction pattern appearing On the plate. 
The locations of the dark fringes are easily determined from the 


Condition u = + nm, where n isa whole number. Thus, the first zero 
Occurs at 


al to x, this cor- 


Fig. 156 shows the 


geet: 
a 


This value of x also represents the distance between two successive 


Positions of zero intensity. 
From this formula, we can de 
May be detected for various wav 


termine when diffraction phenomena 
elengths under various conditions. 
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i i = 0.5 > r be clearly observed 
iffraction of light (A = 0.5 x 10-* cm) may i Tw 

EE T on if the aperture is of the order of ey 
and the distance between the screen and plate is of the or cate 
2 metres. Since for these values z = 1 mm, the effect is clee 
visible. i i 
E of visible light yield noticeable diffraction from a tora 
ball (a = 5 cm), but at a greater distance. If the distance f is equa 


to 100 metres and the wavelength % is 5,000 A, then z = 1 mm. 


I 


Fig. 156 


Thus, in this case too the distance between positions of zero intensi- 
ty of the scattered radiation is of the order of magnitude of 4 mm. 
Diffraction of radio waves may also be observed if a proper choice 


PA È s a e 
of conditions, as determined by the equation z aif , is made. 


Assume that the values of f and A are fixed. The width of the slit 
greatly affects the diffraction pattern. If the width of the slit is large, 
«—> 0, i.e., the slit image focussed by the lens is infinitely narrow. 
As the width of the slit is decreased, the diffraction pattern begins 
to take shape and the first diffraction minimum begins to move fur- 
ther and further away from the centre of the pattern. Finally, when 
the slit is made so narrow that our approximation in the formula 
for u (the substitution of sin P by tan @) is no longer justified, the 
image of the slit on the screen becomes smeared. A still further de- 
crease in width, to the point where the wavelength and the width 
of the slit become equal, results in the slit yielding secondary radia- 
tion as a single source. The interference of primary waves disappears 
and the primary wave is radiated from the slit in all directions. 

For apertures and particles (or inclusions) having other shapes, the 
diffraction patterns appear entirely different (see Fig. 154). Never- 
theless, the general laws remain valid and the basic features of the 
pattern are maintained. Thus, for example, in the case of diffraction 
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from a circular aperture or other circular nonuniformity, concentric 


rings are formed and the diameter of the smallest dark ring is 4.2940 F 
where D is the diameter of the aperture. 

Since diffraction patterns have maxima at different locations for 
different wavelengths, white light is resolved into its spectrum upon 
diffraction. Therefore, diffraction from a circular particle or aper- 
ture has the form of a rainbowed ring. 


138. A System of Randomly Distributed Scatterers 


We have considered the behaviour of various secondary radiators 
of electromagnetic waves as a function of the ratio of their dimen- 
sions to the wavelength of the incident wave. But the properties of 
a radiator are only roughly determined by its dimensions. The detailed 
pattern is determined by the distribution of matter in the scat- 
tering particle. Only when the dimensions of a particle are small 
relative to the wavelength is the distribution of matter in the par- 
ticle of no consequence. In this case, the particle scatters as a whole, | 
i.e., as a single electric dipole. When this condition is not satisfied, 
the pattern is complex since it is determined by the interference of 
waves scattered from various volume elements of the particle. We 
have considered only one example of a scatterer whose dimensions 
are larger than a wavelength, namely, a homogeneous scattering 
particle, which may take the form of an aperture in an opaque screen. 

Now, let us consider the problem of scattering by systems of 
particles—e.g., a system of gas molecules, specks of dust or smoke 
particles; a system of hoar-frost crystals on a windowpane; or a sys- 
tem of holes in a piece of gauze. In all such cases, the pattern 
becomes complex due to the fact that the electromagnetic waves 
emanating from the various scatterers may, generally speaking, 
interfere with each other. Now, the scattering pattern will depend 
not only on the properties of a scattering particle, but on the ar- 
rangement of the particles. Thus, it is important to know how close 
the scatterers are to one another and whether their spacing is regu- 
lar or random. Then, depending on the circumstances, the waves 
scattered by the various particles may interfere to a maximum 
extent, partially or not at all. b 

Confining ourselves to the extreme cases, let us first consider scat- 


tering by a system of randomly distributed particles—e.g., the 


Scattering of X-rays by a large cluster of atoms or molecules having 


a random distribution. 

In the case of a large num 
atoms, molecules or larger i 
tering is determined, as we ha’ 


24—1409 


ber of identical scattering centres (e.g., 
dentical particles), the resulting scat- 
ve already stated, by the scattering of 
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-a single centre (region) and the arrangement of the scattering centres- 
The scattering pattern in the case of uniform distribution of scat- 
tering centres is quite different from the pattern in the case of ran- 
dom distribution. 

If the scattering centres are randomly distributed as, for example, 
in the case of gas molecules, the waves scattered by different centres 
may be considered to be noncoherent. This is because in the case 
of randomly distributed scattering centres the phase relationship 
between waves emanating from different centres is quite arbitrary. 
It is safe to say that the number of waves with positive amplitudes 
arriving at a point of observation (from different centres) will be 
exactly equal to the number of waves with negative amplitudes 
arriving there. The result is clear. Designating the amplitudes of 
the waves from the various centres by A,, As, Ag, etc., the total 
amplitude at the point of observation is 

A=Aj+A,+A5-+.... 
sce the intensity is proportional to the amplitude squared, we 
obtain 


DA ARRAS es DAA, 1 DAIA, lo 2AyAgt ... > 


But among the double products there will be just as many positive 
terms as negative terms. Therefore, with a high degree of accuracy, 
the total is given by the sum of the amplitude-squared terms. In 
other words, the total intensity scattered by identical randomly 
distributed centres isex- 
pressed as follows: 
I=NA?, 
where W is the number of 
scattering centres and A is 
the scattering amplitude of 
one centre. 

Thus, it turns out that 
scattering by a large number 
of randomly distributed par- 
ticles is very similar to scat- 
tering by a single particle- 


Film The only difference is that 
the effect is V times greater- 
Fig. 157 Data on scattering by asin- 


gle molecule are obtained by 
investigating the scattering of 
laboratory set-up for the study 
X-rays is made monochromatic 
is directed into a gas chamber- 


X-rays by a gas. Fig. “457 illustrates a 
of X-ray scattering by gases. A beam of 
by reflecting it from a crystal, whereupon it 
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Scattering is determined by its action on a photographic film, i.e., the degree 
of blackening is a measure of the scattering intensity. We can thus obtain the 
intensity as a function of scattering angle—rapidly decreasing smooth curves 
for monatomic gases and decreasing curves having small maxima for polya- 
tomic gases. By means of these curves, and theoretical formulas, the electron- 
density distribution in a molecule may be determined. 

There are many examples of systems that scatter electromagnetic 
waves as a gas scatters X-rays. 

The scattering of light rays in a room with dust is a familiar exam- 
ple. Through a chink in a window curtain, a narrow, rectilinear beam 
of light visible from all sides penetrates a room. Light waves act on 
a system of dust particles in much the same way as X-rays act on 
a system of molecules. The distances between specks of dust are 
quite large and the distribution of the particles is completely random. 
Hence, there is no interference between the waves scattered by 
different dust particles, and the scattering pattern is similar to 
that created by asingle speck of dust. The sole difference liesin greater 
intensity, i.e., the intensity is proportional to the number of dust 
particles in the field of the primary beam of light. Each speck of 
dust behaves as an elementary electric dipole, for the dimensions 
of such a particle are less than the wavelength of light. Therefore, 
the laws applicable to the scattering of light by dust particles, i.e., 
the dependence on the wavelength of light and the nature of angular 
distribution, are the same as for an elementary electric dipole (i.e... 
the intensity formula given on p. 326 and the intensity distribution: 
shown in Fig. 133 are applicable here too). rc wet! 

This agreement can also be easily demonstrated by comparing the 
diffraction pattern from a single aperture with that from a system of 
randomly distributed apertures. Experiments show that, as regards 
the relative intensity distribution of scattered light, these two 
diffraction patterns are entirely alike. Of course, in the case of N 
apertures, the intensity of the light scattered by the screen is V times. 
greater than the intensity of the light scattered by an opaque screen 


having one aperture. 
Since the scattering pattern due to a large number of randomly 
m as that due to a single centre, 


distributed centres has the same for 2 
it becomes clear why a lantern observed through a window covered 


i ili i i ttern is 
with hoar-frost has the familiar rainbowed halo. This pa 

simply the result of diffraction from the ice particles. Soe 
distribution is perfectly random, they behav eas circular” particles, 
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I : icle we shall consider the other extreme case of the 
n this article erfectly homogeneous medium we 


phenomenon of scattering. By a p hon : 
mean a system of scatterers that are distributed uniformly and 
24* 


W a 
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continuously throughout a given region. Transparent glass consti- 
tutes such a medium with respect to light waves. Since the wave- 
length of light is considerably greater than the distance between the 
atoms in glass, a piece of transparent glass may be considered to be 
divided into volume elements that are considerably smaller than 
the wavelength, but at the same time still containing a large number 
of molecules. The medium may be considered homogeneous if the 
number of molecules in all such volume elements are approximately 
the same. 

The scattering of electromagnetic waves is not only a function of 
uniformity in regard to the number of radiators per unit volume but 
is also a function of uniformity in the orientation distribution of 
the radiators. In the final analysis, the scattering power of a body 
is determined by its dipole moment, which is composed of the indi- 
vidual dipole moments of the molecules contained in this volume. 
It may be stated, therefore, that the scattering power of a body is 
determined by the value of the dielectric constant or, since € = n”, 
by its refractive index. Hence, for a given wavelength, a medium 
that is uniform as regards the scattering of electromagnetic waves 
must also be uniform as regards the refractive index. 

Observations on the behaviour of electromagnetic waves in per- 
fectly homogeneous media show that no scattering takes place in 
such media. Thus, when a ray of light passes through a transparent 
body, the ray cannot be seen from the side (compare this with the 
case of a light ray entering a room with dust). 

Each volume element of a homogeneous body is a wavelet source. 
Nevertheless, wave scattering does not occur. Only one explanation 
is possible, namely, the wavelets scattered by a homogeneous medi- 
um in any direction at an angle to the primary ray are completely 
annulled due to interference. This theorem is capable of rigorous 
proof but the proof will be dispensed with since it is quite evident 
that this is the only possible explanation. 

Nevertheless, the phenomenon of scattering does make itself felt 
quite considerably in a homogeneous medium. Scattered waves. annul 
each other in all directions except one, namely, the direction in 
which the primary wave is propagated. Forward scattering does not 
represent simply a superposition on the primary wave but a change 
in its velocity as well. 

It turns out that the phenomenon of electromagnetic wave refrac- 
tion, which we have already discussed, may be interpreted as a natural 
consequence of scattering. 

An electromagnetic wave propagated in a medium may be repre- 
sented as the sum of the primary wave and the scattered waves- 
Theoretical calculations show that the superposition of these waves 
leads to the retardation of the primary wave. 
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A substance uniformly distributed, in the sense that we have just 
discussed, does not scatter electromagnetic waves. Although all 
portions of this substance create wavelets, secondary radiation is 
not observed from the sides: No matter what point in the region we 
chose as our point of observation, it can be rigorously proved that 
the waves scattered by a uniform substance annul each other due to 
interference. Since for any particular wavelet there always corre- 
sponds another that is exactly opposite in phase, the net effect is 
complete cancellation. : 

Now, let us assume that in some limited region the substance has 
a greater density than in the surrounding medium, i.e., that it has 
an excess of dipoles per unit volume. Then, all wavelets are annulled 
except those created by this excess density. As always, the scattered 
radiation is determined by taking the sum of the wavelet amplitudes, 
whereby the phase differences between the wavelets arriving at 
the point of observation must, of course, be taken into account. 

Is the situation any different if the density of the scattering region 
is less, instead of greater, than the surrounding medium? Scattering 
will cease if matter is added to such a scattering region until the 
medium becomes homogeneous. Clearly, the net effect is the same 
if we add a certain amount to a given quantity or subtract this same 
amount. Hence, the scattering from a region of lower density is equal 
to the scattering from the missing substance, i.e., the substance 
required to make the medium homogeneous. t : 

Thus, it is only important that the scattering region have a densi- 
ty distribution of matter that differs from the surrounding medium. 
Moreover, as regards scattering, the effect of a positive, density 
deviation is indistinguishable from a negative density deviation of 
the same magnitude. For example, the scattering from porous glass 
is the same as the scattering from glass containing randomly dis- 
tributed inclusions of exactly the same size as the pores. _ 

Due to the large wavelengths of radio waves, scattering will take 
place in this case only when the nonuniformity of the density occurs 
on a relatively large scale. For example, in order for the scattering 
of kilometre waves to be detectable, the extent of the deviations 
from average density that are intercepted by the waves must be at 
least several hundred metres. The waves are unable to “detect 
Smaller i ions or density gaps: J E 

The a of light waves is detectable when disturbances in 
the distribution of scattering matter are at least of the order of sever- 
al tenths of a micron. Thus, light waves are unable to detect non- 
uniformities in the distribution of electrons in a molecule or in the 
region between two adjacent molecules since these phenomena are 
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restricted to regions whose dimensions are much less than several 
tenths of a micron. The situation is different as regards X-rays. 
In this case, the wavelengths are of the same order of magnitude 
as the dimensions of an atom. Hence, an individual atom appears 
as an “inclusion in a void”. 

The scattering of light waves on nonuniformities is a common 
phenomenon. Nonuniformities in a scattering substance are easily 
recognised by the external appearance of the medium, i.e., by its 
turbid appearance. The conditions needed for light scattering prevail 
in opalescent glass, dust-laden air, etc. In all these cases, there are 
random disturbances of the density of the substance, whereby the 


dimensions of the disturbed regions approach the wavelength of 
light. 


As was indicated above, if a particle or nonuniformity scatters as 
a single dipole, in other words, if its dimensions are no greater than 
one-tenth to one-twentieth of the wavelength, the scattering inten- 
sity as given by the dipole formula (see p. 363) is proportional to 
the frequency, and inversely proportional to the wavelength, taken 
to the fourth power. This is the explanation for the following inter- 


esting phenomenon. When white light is scattered by a medium 
ires a blue coloration since 


t o bid media nonhomogeneous. 
A homogeneous gas or liquid is optically nonhomogeneous due to the 
presence of density fluctuations. This may be shown by calcula- 
tions. Light Waves scattered in a region whose linear dimension is 
0.02 micron, i.e., one-twentieth of the wavelength A, may be con- 
sidered to be in phase. There are, on the average, 215 molecules 
in such a gas volume (8 x 10-18 cm3) un 


l olun der normal conditions. The 
relative fluctuation in the paber of particles according to the laws 
of statistical physics is T rie, approximately 4 per cent. Thisis 
a perfectly perceptib i i i i 

a ae perceptible nonuniformity as the Scattering of light by 


The blue colour of 
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lar structure of the substance rather than its impurities. Investiga- 
tion of the molecular scattering of liquids is of interest as a method 
for the determination of certain features of molecular structure. 
As regards the nature of scattering, a nonhomogeneous medium in 
which the regions of deviation from the average density are suffi- 
ciently far apart and quite randomly distributed does not differ from 
a system of randomly distributed scattering centres (Sec. 138). For 
the most part, however, in discontinuous media (such as fluids and 
amorphous solid bodies in the case of X-rays, opalescent glass and 
colloidal systems in the case of light rays, and the atmosphere in 
the case of radio waves) the interference of waves scattered by neigh- 
bouring regions of lower or higher density affects the form of the 
scattering pattern. This type of interference yields a scattering 
pattern that significantly differs from the ideal scattering pattern 
obtained from a single electric dipole. 

We have considered scattering of electromagnetic waves by 
a system of randomly distributed particles, scattering in a perfectly 
homogeneous medium, and finally, as-an intermediate case, scatter- 
ing in a nonhomogeneous medium. One more important case remains 
to be discussed, namely, scattering of electromagnetic waves by 
systems of uniformly distributed centres. Under this case, the follow- 
ing systems will be considered: a diffraction grating for light waves, 
directional radiators for radio waves and crystals for X-rays. 


141. Diffraction Grating 


A diffraction grating may be constructed using a glass plate coated 
with a thin layer of aluminium. By means of a special device, uni- 
formly spaced lines are inscribed on this plate with a soft ivory tool. 
In such a “grating”, the nonuniformities (lines) are uniformly distrib- 
uted, leading to a number of light scattering peculiarities. 

We shall be speaking about an optical diffraction grating, but the 
discussion applies to any regular distribution of nonuniformities and 
scattering centres and to all electromagnetic waves, i.e., from the 
shortest to the longest (kilometre waves). The discussion will be- 
restricted to diffraction using parallel rays, which may be realised 
as indicated in Sec. 137. 

If all scattering centres are identical, as is undoubtedly the case 
in an optical diffraction grating, the diffraction pattern may be 
determined in the following manner. 

Consider the amplitude of the wave emerging at an angle @ to the 
incident wave. The total amplitude is equal to the sum of the ampli- 
tudes of the waves scattered by the individual centres. If the waves 
from the individual centres arrived at the point of observation in 
phase, the total amplitude would be equal to the product of the 
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number of centres V and the amplitude of an individual centre fs 
However, the wave from each centre is displaced in phase relative 
to the wave from an adjacent centre, and it may be assumed that the 
magnitude of the displacement is the same in each case. Waves from 
different centres interfere with each other and the resultant intensity 
is equal to Lf? rather than Nf?, where L is a quantity greater than V 
for the directions in which the waves reinforce each other and less 
than WV for the directions in which they arrive, in the main, out of 
phase and annul each other. 

The directions for which the waves from all centres reinforce each 
other may he easily determined with the aid of Fig. 158. The path 


Fig. 158 


difference between waves emanating from corresponding points of 


adjacent centres is equal to asin ọ. If this path difference is equal 
to a whole number of wavelengths, the waves reinforce each other: 
a sin ọ = nÀ (maximum condition). As may be seen, there are 
several such directions. If the wave impinging on the grating is not 
monochromatic, the grating resolves the wave into its spectrum. 
Moreover, there will be several spectra rather than just a single 


spectrum. The number n appearing in the aboye equation is called, 
therefore, the order of the spectrum. 


Since it is somewhat cumbe: 


, s grating consisting of a large number 
of scattering centres (lines) is 


l ine .of considerable practical importance. 
Imagine the grating divided into two parts. Now, compare a pair 


N 

>+1) st centre, the 
aximum reinforcement 
1 pairs of rays is equal 
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to Z na. If the path of the rays is slightly changed, the rays being 


inclined so that the path difference increases by Es maximum 


reinforcement of the superimposed waves is replaced by total annul- 
ment. The first wave cancels the (3 F 1) st, the second cancels 


the z +2) nd, etc. Exact calculations show that for positions 


further away from the maximum the intensity remains practically 
equal to zero until the angle of inclination @ approaches the position 


of the next maximum. 2 f , 
The angle at which maximum diffraction of n-th order occurs is 


S = nh 
given by the formula sing = 7 - 

If we designate the half-width angle of the maximum by Ag, then 
for the angle (p + Ag) we may write the condition 


a 
N asin(p HA = p tg: 


Hence, R 
n. 


sin(p +49) == t Wa 
or 
. A 
sin (p+ Ag) — sin 9 = Fa 


The distance between two successive maxima is determined by the 


expression x 


sin po — Sin Q= 7: 


-width of a line is, roughly speaking, fe of the 
en N is large, i.e., in the case of grat- 
ber of scattering opa eS ID diffrac- 
ti i r mely narrow and the resolution in the spectrum 
obtain A is very fine. Imagine, for example, that 
light containing two close waves, Aà and A+ peach oi a grat- 
ing. For simplicity, assume we are concerned wit w tering at 
angles less than 20°; hence, sin @ ~ Ẹ- Then, in the wn order iee 
two lines are displaced by the angle 6p, which, as can be seang rom 
is approximately equal to — 6a. 


ch wave may be determined from 


We see that the half 


distance between maxima. Wh 
ings consisting of a large num 


n 
a 


the condition ọ œ% sin 9 = 
The width of the maximum for ea 
the equation 1 

sin (p+ 59) —sin p © ôP = Fa - 
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Evidently, these two lines may be distinguished (in optics, we 
say resolved) if 
Zuo 


Na ` 
The expression x= nN is an indicator of the resolving power of 
the grating. 


Example. In a good diffraction grating, the distance a between lines is 
a~10-3mm and the number of lines Ñ is equal to 100,000. Then, the resolving 


power for the spectrum of second order is + = nN = 200,000. This means 


that for A= 6,000 A, for example, two lines whose wavelength difference is 
0.03 A may be resolved. 


Let us consider the intensity of a diffracted beam. The waves 
directed towards a maximum point act in phase. If f is the amplitude 
of the wave scattered by one centre, the total amplitude in the maxi- 
mum direction is Vf and the intensity is V°f?. Thus, the height of 
a diffraction maximum is proportional to the number of scattering 
centres squared and, since the width of a maximum is inversely pro- 
portional to N, its area (i.e., the integral of the maximum intensity) 
is proportional to N taken to the first power. If different maxima are 
compared, it will be seen that the ratio of their heights (or, what 
amounts to the same, their areas) depends on the value for these 
directions of the amplitude f of the scattering from one centre. 

Thus, the period of a grating determines the locations of the maxi- 
ma, while the form (in the broad sense of the word) of a line or scat- 
tering centre determines the intensity of the maxima. 

Let us assume the angles Q4, 
period of a grating. Scattered rays occur onl 
what will be the intensity of these ra 


angle Ps, the amplitude f is close to zero 
will not appear in the diffraction 
in Fig. 159, which shows the diffraction S 
tors f of one centre (dotted curves) f 
ments. : 

These principles form the basis of any study of structure by means 
of diffraction spectra. The distan lines enables 
us to determine the period of the grating—assuming, of course, that 
the wavelength is known—and the intensity ; 


y of lines of different order 
enables us to determine the structure of a scattering centre. 
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Example. Consider a diffraction grating for which a = 3 Xx 107ë mm and 


N = 1,000. Assume a parallel monochromatic beam of light (A = 5,000 A) 


impinges on this grating. Diffraction maxima are visible at the angles given by 
sin m= =e , and the width of a diffraction maximum is equal to 


A 1 r A 
Wa 3.000 re vali r: tr 
2 Wa = 3,000 ` The obtained results are valid for any scattering centre 


form. To calculate the relative intensity of the diffraction maxima, it is 
necessary to consider specific scattering centres. Let us examine two cases: 

1. The scattering centres are single strips of width bat a=0,75 x 
10-3 mm (Fig. 159a). In Sec. 137, we obtained a formula for the inten- 
sity of a wave diffracted at a slit. The magnitude of the intensity is pro- 
pon onal to the amplitude squared of the scattering from one strip: f? ~ 
~ Snt » where u="; sing. The intensity of the n-th diffraction maxi- 


mum is determined by the magnitude of fùn in the direction Pn, which is 


determined, in turn, from the equation sin m= . If fj is taken as 100, 
the other intensities have the following values: 


Ji=80; f{ł=40; f§=8.9; 3=0; f2=3.2. 


2. The scattering centres are double {strips of width baz a=0.75X 


X10-3 mm each. The grating period a is the same as before (Fig. 159b). 
Clearly, the location and width of the diffraction maxima have not changed. 
Calculations similar to those performed in Sec. 137 show that for two 


slits 
7p sin2u 201 tog 
~ ae 1-++cos [= (e+e) singe |} ` 


From this expression, the relative intensities of the diffraction maxima are 
easily determined. Assuming, as before, that fè = 100, then 


31=12; ff=20; /2=7.5; R=0; f2=2.7. 


142. Directed Radiators of Radio Waves 
__ In certain radio engineering applications, particularly radar, 
it is required to direct a radio beam into space in such a manner 
that the transmitted energy is concentrated in a very small solid 


angle. One solution of this problem is to utilise a linear array of 
antennas. 


In Sec. 141, we saw that for uniform Spacing of scattering centres 
the radiated energy is concentrated in specific directions. If radia- 


tors of radio waves are arranged in a single row (see Fig. 160) with 
a distance a between adjacent antennas, and if all the antennas are 
fed synchronously, such a radiator array will in no way differ from 
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a scattering diffraction grating. The fact that we are dealing here 
with primary waves in no way detracts from the applicability of 
the discussion in the preceding article to the present case. It is only 
necessary that it be permissible to consider the radiation from differ- 
ent antennas coherent, which is the case if the antennas are fed 
synchronously from a generator of oscillations. 

IE the antenna array is so dense that the distance between adjacent 
dipoles is less than a wavelength, even first order diffraction is 


Fig. 160 Fig. 161 


impossible, as the equation a sin ọ = nì shows. Only the zero order 
remains. This means that there are only two radiation maxima— 
one forms a 0° angle with the normal to the array and the other 
a 180° angle. The point made in Sec. 141 about the width of the 
maximum is valid here too, namely, the larger the total number 
of radiators, the smaller the solid angle in which the beam intensity 
reaches an appreciable value. t 

However, in practice, it is inconvenient to have radiation of equal 
intensity in two opposite directions. To avoid this, double arrays 
of dipoles are used (see Fig. 161). The antennas of each dipole pair 
are separated by a distance of 1/4 and a phase difference of 90° exists 
between the two currents. As a result, one of the two maxima is 
annulled and all the energy is concentrated into one diffraction 
maximum. The phase relationships existing for each dipole pair in 
such an array may be explained as follows. For the “forward” waves: 
if the waves were propagated synchronously, the path difference 
between them would be 4/4; but the antennas do not operate synchro- 
nously, i.e., the wave radiated by the “front” dipole lags by 90°, 
which compensates for its lead due to the path difference. The situa- 
tion is different for waves propagated “backwards”. The displace- 
ments due to the path difference and the phase difference between 
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the currents in the antennas are in the same direction. Hence, the 
total phase difference is 180°. As a result, there is no radiation in the 
backward direction. 


Fig. 162 


If the antennas are arranged in a row, a narrow beam may be 
obtained in the plane perpendicular to the antennas. But if a small 
beam in space is desired, a more complex system of oscillator ele- 
ments must be used. That is why radar antennas have such strange 
appearances (see Fig. 162). 


CHAPTER XXII re 


DIFFRACTION OF X-RAYS BY CRYSTALS* 


143. Crystals as Diffraction Gratings 


As has been already indicated, a diffraction grating usually con- 
sists of a piece of glass on which equally spaced lines are scratched. 
What is essential here to obtain one of the typical diffraction patterns 
discussed above? Is it the presence of glass, the shape of the lines, 
the thickness of the glass, or the width of the “slits”? A careful study 
of Sec. 144 shows that the essential element is the periodic repetition 
of the nonuniformities of the scattering substance. Thus, irrespective 
of the cause of the scattering and the nature of the nonuniformities, 
as long as these nonuniformities are repeated with a periodicity a, 
scattering maxima will occur at angles @ satisfying the equation 
a sin p = md. Such a pattern is given by lines of any shape scratched 
on any glass, or slits of any shape made in any screen. It is only 
necessary that the distribution of matter repeat with a periodicity a. 

To be sure, certain differences in patterns may exist. The inten- 
sities of rays diffracted in different orders may differ depending 
on the shape of the slit. The distribution of substance within a repeat- 
ing nonuniformity affects the scattering intensity f*, which for 
different orders may have different values. 

Now that we have reviewed the results of Sec. 144, let us turn 
to crystals. The ‘basic feature of crystals distinguishing them from 
other bodies is the periodic distribution of matter. Along any 
direction of a crystal, the time-averaged electron density is periodi- 
cally repeated. In the simplest case, the electron density distribu- 
tion will appear as shown in Fig. 163. This figure shows the electron 
density (the number of electrons per cubic Angstrom) along a line 
parallel to the edge of a cube of rock salt. An electron density max- 
imum corresponds to the centre of an atom. The large maximum 
corresponds to the centre of a chlorine atom and the small maximum 
to that of a sodium atom. The pattern repeats itself after every oth- 
er atom, and the period of the electron density distribution along 
the line is equal to 5.6 A. This is the pattern of the distribution 
obtained along a specific line. Along a slightly displaced parallel 


line, the distribution will be different. 


ed 
* Before reading Chapters XXII and XXIII, it is recommended that Chap- 


ter XXXII be perused. 
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However, a crystal is a three-dimensional formation, and the 
repeating element is a three-dimensional cell. The electron density 
distribution of a cell cannot be illustrated graphically, but it is 
sufficient to know that the cells repeat in space. The similarities 
and differences between a crystal and a diffraction grating are evl- 
dent. A crystal is a three-dimensional diffraction grating in which 
the nonhomogeneous element repeats regularly in three-dimensions 


Flectron density 


Fig. 163 


Ae 


rather than along a line. The role of “slit”, i.e., repeating nonuni- 
formity, is played by the unit cell of the crystal. 

Let us determine the nature of the diffraction pattern created 
by a crystal. v 

X-rays are scattered by electrons. The nonuniformities in the 
electron density are of such a nature that wavelengths of the order 
ofj41-2 A yield perceptible diffraction. In order to determine the 
direction of the diffracted rays, the wavelets coming from all the 
cells must be added. Of course, the amplitudes of these wavelets 
for a given direction are the same. The difficulty arises in taking 
account of the phase differences between wavelets scattered by 
individual cells. These wavelets must be added for every direction 
and the directions in which the wavelets reinforce each other to 
a maximum extent determined. 

The problem can be solved in various ways since various summation 
sequences are possible. For example, first wavelets from the 
cells along edge a may be added, then wavelets from all columns 
in the plane ab, etc. But we shall use a much simpler method. This 
is the method proposed by the founders of X-ray structural analysis, 
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the two Braggs—father and son, British scientists. (The same idea 
was proposed independently by the Russian erystallographer Vulf.) 
In a crystal, parallel planes can always be passed through the lattice 
nodes in numerous ways; the crystal is composed of the layers between 
such successive planes. Let us construct a normal to these layers” 
and imagine the electron density projected onto the direction of the 
normal. Clearly, a periodic 
electron density distribution 
exists along the normal. 
The period d is appropriate- 
ly called the interplanar 
distance. 

The condition for maxi- 
mum reinforcement of waves 
scattered by the cells of one 
layer is that the angle of 
incidence equals the angle 
ofreflection, Thisconclusion, =r Sp "S| os lle oe 
is based on Huygen’s 
principle, for only when the 
above condition is satisfied 


are the scattered waves d ; 
propagated in phase, and thus reinforced. Waves of successive layers 


reinforce each other when certain additional conditions are met. 
Fig. 164 shows that the path difference between rays reflected 
from the corresponding elements of two adjacent layers is equal 
to 2d sin 0. Thus, diffracted rays are obtained when the condition 


QIdsin0=nd 


Fig. 164 


is satisfied. 
A diffracted beam is obtained when a system of planes may be 


found, among the countless such systems into which the crystal may 


be divided, which satisfies the condition 2d sin 0 = nÀ. Of course, 
this requirement may si ; 


planes. However, the more likely situation is that diffraction does 


not occur for an arbitrary direction i 
Hence, in order to observe diffraction, the crystal must be turned 
’ 


until a suitable angle 9 is found 


i i e in a calcite crystal is equal to 3.029 A. 
Fear ae interea ON from a copper anode is often used. 


In X-ray structural analysis, L l > ; 
Since ap radiation has a wavelength of 1.54 A the diffraction maximum of 
À ms 14°40". 


first order occurs at 0 = arcsin Dat 


25-1409 


= 
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amplitude of a cell is expressed by 
a 


F= f p cos (ot + @) dz. 
0 


The integral is taken over a single period (the interplanar distance); 
the values of p and @ for each z differ. 
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We shall not concern ourselves with the problem of determining 
the electron density of a crystal at all points of a unit cell. In all 
cases the basis of the solution is the above equation. It enables us 
to calculate the amplitude of any diffracted beam if the electron 
density of the crystals is known. Of interest, however, is the 
converse problem—the determination of the electron distribution in 
a crystal from the experimentally determined intensities of diffract- 
ed beams. This problem has been solved for very complex crystals 
containing hundreds of atoms per unit cell. 

In the case of anthracene crystals, the intensity of about 600 
diffracted beams has been measured. Using these data, the values 
of the electron density at all points of such a cell were determined. 
Fig. 167 illustrates the electron density for a cross-section through 
the centres of atoms of an anthracene molecule. (The method adopt- 
ed to indicate the electron density is that used by topographers to 
indicate elevation. Electron density contour lines on the electron 
density diagram correspond to elevation contour lines on a topo- 
graphical map.) The fourteen distinct peaks represent fourteen hydro- 
gen atoms. Experiments show that the distance between adjacent 
peaks is 1.4 A. The height of an electron density maximum is pro- 
portional to the number of electrons in the atom. Carbon atoms 
have six electrons each and hydrogen atoms one. Not all of the ten 
peaks corresponding to the centres of the ten hydrogen atoms con- 
tained in an anthracene molecule are fully sketched in the diagram. 


The chemical formula of anthracene is C1,H10- 
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The measurement of the angles 20 formed between the diffracted 
beams and the beam incident on a crystal, as well as the intensities 
of the diffracted beams, may be accomplished using an ionisation 
chamber (see p. 532) or a photographic method. A photographic 
film on which the traces of many diffracted beams are simultaneously 
recorded is called a roentgenogram. 

But how can the traces of several beams be obtained on a single 
film when, as has been already indicated, the condition nà =2d sin 
in all likelihood will not be fulfilled even once for an arbitrary orien- 
tation of the crystal relative to the beam? This can be accomplished 


in three ways: i ’ 
1) by rotating the crystal, thereby setting various systems of 


planes in a reflecting position; Í 

2) by illuminating the crystal with a continuous spectrum of 
wavelengths in a band sufficiently broad so that almost every system 
of planes finds a “suitable” wavelength in the spectrum; and 
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3) by obtaining a roentgenogram of a powder, since in a powder 
the diffraction condition for any d is always fulfilled for some crystals. 
The first method is known as the crystal rotation method, the 
second as the Laue method, after the German physicist whose name 
is associated with the discovery of beam diffraction, and the third 
as the powder or Debye method, after the scientist who proposed 
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this method. The Laue method has extremely limited application. 
In practice, the rotation method is used for the investigation of crys- 
tal structure, i.e., for the determination of arrangement of atoms 
and the Debye method is used for special problems arising in connec- 
tion with the investigation of fine crystalline substances. 

The purpose of a rotating roentgenogram is to gather data on 
one film regarding existing interplanar distances and intensities 
of the respective beams. However, it is also necessary to know the 
orientation of the system of planes relative to the crystal axes. This 
requires knowledge not only of the location of a particular spot 
on the film, but also of the instantaneous orientation of the crystal 
when it arose. In order for the roentgenogram to provide this infor- 
-mation as well, the film is displaced during filming. Such filming 
methods are known as roentgengoniometric methods. 

Fine crystalline substances are used much more often than mon- 
ocrystals for the investigation of crystal structure by means of 


X-rays. Fig. 168 illustrates how a roentgenogram is obtained by 
the Debye or powder method. : 
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Let us direct our attention to a specific system of planes sepa- 
rated from one another by a distance d and having a normal 7. 
Assume that the normals n of the crystals are directed in all direc- 
tions. A beam “reflected” from a plane is not produced by reflection 
from all the crystals, but only from those whose planes are at an 
angle Ù to the beam, where ® satisfies the condition 2d sin Ò = ni. 
Accordingly, the normals of these crystals are at an angle of 90°-0 
to the primary beam. All crystals whose normals lie on the cone 
having an apex angle 2 (90°-0) are in a reflecting position for planes 
separated from one another by the distance d. Therefore, the “reflect- 
ed” beams also form a cone, and this cone has an apex angle of 40. 
Ring patterns are obtained when these cones are intersected by a 
photographic film or plate. 

If we are concerned with values of ® not exceeding 20-25°, the 
roentgenogram may be taken on a flat plate. In this case, we obtain 
a system of concentric rings. However, if information on all inter- 
planar distances is desired, and scattering over the entire interval 
of angles possible is to be analysed, cameras with cylindrical film 
are used. In this case, the height of the film is reduced, so that only 
parts of the rings are photographed. 

If for some problem or other it is important to ascertain the inter- 
planar distances more accurately, we usually resort to “pear” film- 
ing, i.e., an arrangement whereby diffraction cones having apex 
angles close to 360° are filmed. 

In determining the scattering angle 0 from the film, a measure- 
ment error AÙ inevitably arises. Let us see how the magnitude of 
this error is reflected in the determination of the magnitude of the 
interplanar distance. ‘After differentiating the diffraction condition 


2d sin Ò = nd, we obtain 
| AF] =cotoao. 


It is seen that the accuracy in measuring the interplanar distance 


rapidly increases as the angle @ approaches 90°. The diffraction 
angle may be easily measured with an accuracy of the order of 0.1°. 
Hence, the above equation shows that along lines for which the 
angle Ù is equal to 65°, 75° and 85° the interplanar distance may be 


measured with an accuracy of 0.13%, 0.08% and 0.05%, respective- 
ly. Using special cameras, the method of rear filming yields very 
good results, enabling interplanar distances to be determined with 
an accuracy of 0.00001 A. For best results, the radiation wavelength 
is selected so that the scattering angle approaches J Banh 

All three types of Debye filming are widely used in the investi- 
gation of the structure of matter. Every substance yields a specific 
system of lines that is characteristic only of it. A phase transforma- 
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cules is not necessarily manifested. Cubic crystals possess isotropic 
polarisability. Here, the equation P = a remains valid, just as 
for isotropic bodies. To make this clear, let us consider molecules 
whose electrons may be displaced only in a single direction. The 
symmetry of a cubic crystal is such that molecules whose axes of 
polarisability form right angles may always be found in the crystal. 
Let us consider three such molecules whose axes of polarisability 
lie along the x, y, z axes of a Cartesian system of coordinates. When 
an arbitrarily directed field # is 
applied, these molecules become 
polarised and produce dipoles of 
moments BE cos a, PE cos Ge 
and BE cos œs, since the moment is 
proportional to the projection 
of E on the polarisation direction. 
The resultant of these three dipole 
moments is obtained by vector 
addition. But a, œs, œa are the 
angles formed by the vector Æ with 
the coordinate axes. Hence, the mag- 
nitude of the resultant moment is 
EV B? (cos? a cos? a, + cos? a3) =PpE 
and the direction is parallel to Æ. 
Taking the summation for all the 
molecules, we arrive at the same conclusion as regards the pola- 
risation vector P and the polarisability « per unit volume. 

Now, let us consider crystals having one main axis, i.e., crystals 
with tetragonal and hexagonal syngony. To be specific, let us 
consider the former, i.e., crystals in which each molecule has three 
identical molecules related to it via an axis of symmetry of the 
fourth order. Assume, moreover—again for purposes of simplicity — 
that the molecule can be polarised only along one axis. Let us direct 
our attention to the four molecules whose axes of polarisability 
form an angle e with the main axis (see Fig. 169). How do these 
molecules behave in electric fields of various directions? If the vec- 
tor ŒE is directed along the main axis, the polarisation is propor- 
tional to cos e. Moreover, in view of the symmetry of the arrange- 
ment, the resultant dipole moment of these molecules is parallel 
to E; hence, the polarisation vector is also parallel to H: 


P=aB, 
where œ is the polarisability created by all the molecules for this 
direction of the field. 


In the projection onto the plane perpendicular to the main axis, 
the polarisability axes form right angles with each other. Therefore, 


Fig. 169 


147. Anisotropic Polarisability 395 


the result obtained for a cubic crystal is valid here, namely, the 
polarisability is the same for all directions of Æ in the plane perpen- 
dicular to the main axis. If Æ is perpendicular to the main axis, 
the polarisation vector P is again parallel to B: 


JAA LTE 


While the polarisability of the molecules along the axis is pro- 
portional to cos €, the polarisability of the molecules in the direc- 
tion perpendicular to the axis is proportional to sine. This means 
that œj and a, are different. 

What is the situation when the field Æ is inclined to the main 
axis of the crystal? In view of the difference in the polarisabilities 


Fig. 170 


inci ith the direction 
a and æ}, the vector P can no longer coincide wi 
or the field, and the value of œ will also be different. Kenovank Cty) 
and œ}, œ may be calculated for any direction. We shall not expui 
how this is done in the general case, but merely cite a numerica 


example. ; - K. X 

Fora crystal of Iceland spar (calcite), «~y = 0.139 and fal = oe 
Assume that the vector Æ forms a 30° angle with the plane perron 
dicular to the main axis and that it is directed as aroy in Fig. 170. 
The polarisation vector in the indicated plane is then 


P, =a, E, =0.139 x Ecos 30° = 0.120 Z. 


The polarisation vector perpendicular to this and parallel to the 
Main axis is 7 
Pi = ony Ey) = 0.095 x E sin 30 = 0.0475 E. 


Therefore, the angle between the resultant polarisation vector P 
, 


0.0475 <s 94°40! is means thas the angle 
and the plane is arctan 755 ~ 24 40’. This m 
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between P and E is ~ 8°20’. The magnitude of the polarisation 
vector is P = VP} + Pj, = 0.129 E, i.e., in this case the polari- 
sability œ is equal to 0.129. 

If E is directed at a 45° angle to the main axis, the polarisability 
œ decreases even more relative to ~y and becomes equal to 0.120. 
The angle between Æ and P is then 


0.095 
0.139 


The direction of the vector # in Fig. 170 is such that the angles 
formed by this vector with the polarisability axes of the molecules 
differ. Thus, the angles are greater for the pair of molecules on the 
left. As a result, the right pair of molecules is polarised to a 
greater extent. 

The fact that the polarisability of a crystal possesses an axis 
of symmetry of the fourth order does not signify symmetry in the 
contributions of individual molecules to the magnitude of the 
polarisability for an arbitrary field direction. 

Thus, the values of the polarisabilities differ for different direc- 
tions. This has important consequences: The polarisability is unique- 
ly related to the dielectric constant; but e determines the index 
of refraction (see Sec. 125; e = n?) and hence the wave propagation 
velocity in the crystal; as a result, the electromagnetic wave is 
propagated in the crystal with different velocities in different 
directions. 

Tetragonal and hexagonal crystals (in optics they go under. the 
heading “uniaxial”) possess the following characteristic: All orien- 
tations obtained by successive rotation about the main axis are 
optically equivalent. Crystals having a lower order of symmetry 
do not possess this feature. 

Uniaxial crystals have a main direction and, perpendicular to 
it, a main plane. If the vector E points in this direction or lies in 
this plane, then P || E (and, therefore, D || Æ). Analysis shows 
that in other crystals only three main mutually perpendicular direc- 
tions in which P || E may be distinguished. s 


45° — arctan zæ 10°30’. 
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Division of a Light Field into Two Waves. We shall restrict 
ourselves to the study of phenomena occurring when light is inci- 
dent on the face of a crystal cut in two different ways, namely, per- 
pendicular to the main axis and parallel to the main axis. 

Light propagated along the main axis differs in no way from 
light propagated in isotropic bodies. The electric vector produces 
polarised oscillations of the dipoles in the direction perpendicular 
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to the main axis. Hence, the wave is propagated with the velocity 
vo = A where no =} E1; &, is the dielectric constant for the 


direction perpendicular to the axis. The designations no and vo indi- 
cate that the index of refraction and the velocity of light are 
“ordinary”. 

Recalling that e=1+41a, we find that for Iceland spar, m= 
= VIF än, = 1.658. Hence, vp=1.81 x 10!0 cm/sec. 


The passage of light through such a crystal in the direction of 
the main axis does not change its polarised state. Natural light 
remains natural, and the oscillation direction of the electric vector 
for a polarised wave does not change. 

The simplicity of the case considered is characteristic of a uni- 
axial crystal. Here, any polarised state of an incident wave is capable 
of exciting oscillations in the direction perpendicular to the main 
axis. Hence, the polarisability of the molecules (also e and 7) is 
the same for all such oscillations. 

Now, let us consider the case of normal beam incidence on the 
face parallel to the main axis. 

Different polarised waves behave differently. Consider the behay- 
iour of a linearly polarised beam. If the electric vector is perpendic- 
ular to the axis, the light is propagated with the same velocity vo 
as in the preceding case. But if the electric vector is parallel to 
the axis, the polarisation of the dipoles occurs along an axis for 
which the dielectric constant has another value, namely, £q. There- 
fore, for this propagation direction, the velocity and the index of 


7 G Bes 
refraction have other values, namely, Ve = zy and Ne = Ve, 


e 
respectively. The designations ne and Ve indicate that the index of 
refraction and the velocity of light are “extraordinary”. The reasons 
for the above designations will become evident below. 
Crystals for which ve < Vo are called optically positive; on the 
other hand, those for which ve > Vo are called optically negative. 


For Iceland spar, ne= V1 Hána 5 =1.486 and ve=2.02x1010 cm/sec. 
Iceland spar is an optically negative crystal since ve > Vo. 


Elliptical Polarisation. What is the situation when the electric 
vector Æ of the wave incident on the face of the crystal forms an 
angle @ with the direction of the main axis (Fig. 174)? Experi- 
ments show, and this may be predicted from Maxwell’s equations, 
that the electromagnetic wave becomes divided into two parts. The 
vector Æ must be resolved into the components Z sin p and Æ cos ọ. 
The first corresponds to a wave travelling with the velocity vo, and 
the second to a wave travelling with the velocity ve. This may be 
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shown by determining the path difference between the two waves. 


that are created upon- division of the incident wave. Designating 
the thickness of the crystal by J, the phase difference can be ex- 


pressed as ô= a (ne—no). 


Thus, the polarised state of the wave leaving the crystal has 
changed significantly. The wave incident on the crystal was linearly 
polarised, while the one leaving is a combination of two waves 
having mutually perpendicular oscillation directions and displaced 
relative to each other by 6. What is the nature of this peculiar po- 
larised state? Such light is said to be 
elliptically polarised since the terminus 
of the electric vector describes an ellip- 
tical spiral. If the electric vector of one 
of the waves is given by 


E= Esin @cosat, 
then for the other wave the electromag- 


netic oscillation in the plane perpen- 
dicular to the beam will have the form 


E= E cos q cos (ot + 6). 


The addition of such oscillations has 
Fig: 174 been considered earlier (see p. 108). It 
E- was seen that the point describing two 
such oscillations is an ellipse. The same applies to the terminus of 
the electric vector, but, since the wave is advancing, the terminus 
of the vector Æ describes an ellipse in its projection on the plane 
perpendicular to the beam. In space, the terminus of vector E 
describes an elliptical spiral winding about the beam axis. 

To obtain circularly polarised light by this method, a “quarter- 
wave plate” is used. This is a plate producing a path difference of 
4/4 between waves travelling with velocities Vo and ve. The thick- 
ness of such a plate must satisfy the equation 

2. 
= (no—ne) = ++ mi. 
If a linearly polarised beam impinges on such a plate so that the vec- 
tor # forms a 45° angle with the direction of the main axis of the 
crystal, resolution of this vector yields 


2 
= 
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But this is the equation of a circle. Thus, the described experimen- 
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tal conditions lead to the transformation of linearly polarised light 
into circularly polarised light. - 

Double Refraction. The apparent bifurcation of objects viewed 
through a transparent crystal is a phenomenon that has been well 
known for a long time. It shows that division into two waves may 
occur not only as regards the propagation velocities, but also as 
regards the beam directions in space. Double refraction occurs for 
normal light incidence on a crystal face (ground or naturally 
formed), which is at an angle to the optic axis. The phenomenon may 
also be investigated by means of a plate cut parallel to the axis. 
In this case, the light must be incident at an angle to the normal. 
We shall direct our attention to the latter case. Let us introduce 
another restriction, namely, that the beam be directed in such ~ 
a manner that the plane of incidence of the light is perpendicular 
to the optic axis. 

Assume that a polarised beam impinges on the plate at an angle 
i. By turning the beam about its axis, the position of the electric 
vector relative to the plane of incidence is changed. 

When the electric vector coincides with the incident plane (see 
Fig. 472a), no special effects are noted. Refraction occurs in ac- 
cordance with the law for isotropic bodies, namely, ras — ys 

The refractive index turns out to be no. This is as it should be, 
for the electric vector is perpendicular to the main axis of the crys- 
tal. When the beam is turned 90° about its axis (see Fig. 172b), 
it is also refracted. But now Se = Ne, i.e., the refraction angle 
is different and the index of refraction is that for an extraordinary 
beam. This too is natural, for the vector Æ coincides with the direc- 
tion of the main axis. 

Most remarkable is that an intermediate position does not yield 
a beam with an intermediate angle of refraction, but yields rather 
two beams—an ordinary and an extraordinary beam having refrac- 
tive indexes mp and ne, respectively. As before, the field intensity 
Vector is resolved into two vectors, one lying along the main axis. 
and the other perpendicular to it. Each component creates its own 
field, or wave. In turning the beam of light about its axis, the inten- 
sities of these two beams continuously change; when one beam 
decreases in intensity, the other increases. 

Since the beams are refracted twice, i.e., in entering and leaving 
the plate, the ordinary and extraordinary beams emerge parallel 
to each other. The thicker the plate, the greater the separation 
between beams. If a narrow beam of incident light is used, the 
difference between the refractive indexes may be determined by 


measuring the beam displacements. 
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Now we can explain why the beams are called “ordinary” and 
“extraordinary”. Let us begin turning a crystal plate, whose optic 
axis is parallel to the face, about the normal to the reflecting face. 
If we were dealing with an isotropic body, such rotation could not 
affect reflection and refraction. When we rotate the crystal plate as 
indicated, nothing happens to one beam, i.e., its position in space 
and its intensity remain unchanged. That is how an ordinary beam 
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Fig. 172 


behaves. It is understandable, therefore, that the beam whose elec- 
tric vector is perpendicular to the main axis of the crystal is called 
ordinary. In this experiment, the electric vector component lying 
in the plane of incidence is always perpendicular to the main axis 
of the crystal. This component acts in an “ordinary” manner. On 
the other hand, the component of Æ perpendicular to the plane 
of incidence forms an angle with the main axis of the crystal that 
varies as the crystal is rotated. During such rotation, not only 
does the extraordinary beam’s intensity vary, but its position 
in space varies as well. We see that the extraordinary beam does 
not obey the laws pertaining to isotropic bodies. In the general 
case, the refracted beam is not in the plane of incidence. 
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We shall not go into the rather complex explanation for these 
phenomena. It should be noted however, that these phenomena are 
in complete accord with Maxwell’s electromagnetic field theory. 


149. Polarisers. Investigation of the Polarised 
State of Light 


It was stated on p. 336 that a dielectric placed with its plane 
surface at an angle py to the incident beam may serve asa polaris- 
er. In this case, the reflected beam is completely polarised and 
the refracted beam is polarised to the maximum extent possible. 


fo 


Fig. 173 


t convenient to use a reflecting plate as a polaris- 
am travels at an angle to the incident beam. 
ad. By repeated refraction, 


However, it is no 
er, for the polarised be : 
A stack of glass plates may be used inste 


Fig. 174 


.an almost completely polarised beam may be obtained. However, 
a considerable portion of the light is absorbed in such a device 


(see Fig. 173). ie 
The best kind of polariser is a crystal in which the linearly polar- 
ay be separated out by 


ised ordinary (or extraordinary) beam m è 
means of double refraction. Such polarisers are known as Nicol 


prisms, or simply Nicols. EN y ; 
The polariser proposed by the French scientist. Nicol consists 


of two right-angled prisms made of Iceland spar (Fig. 174). These 
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prisms are glued together with Canada balsam, a substance whose 
refractive index n is 1,550. This value lies between no and ne for 
Iceland spar. A nonpolarised beam of light impinging on the prism 
is divided into two components. The ordinary beam is reflected 
at the boundary between the prisms, where the condition for its 
total reflection is satisfied, and is deflected to one side. The extra- 
ordinary beam passes through both prisms. Thus, a Nicol acts as 
a slit passing only electric vector oscillations directed in a specific 
direction. 4 

Pleochroism is of great practical importance in polariser appli- 
cations. This term is used to indicate that the ordinary and extra- 
ordinary beams are absorbed differently and that the absorption 
coefficient of the extraordinary beam is a function of the direction 
relative to the optic axis. A pleochromatic crystal gives a different 
hue and absorbs light differently when it ïs turned relative to 
the beam. 

Tourmaline is a classical example of a pleochromatic crystal. 
The absorption coefficient for the ordinary beam over almost the 
entire visible spectrum is so great that a tourmaline plate of 1 mm 
thickness, cut parallel to the optic axis, transmits in effect only the 
extraordinary beam and, hence, may serve as a polariser. However, 
the yellowish-green hue of the transmitted light prevents tourma- 
line from being used in practice as a polariser. 

Polaroids, synthetic pleochromatic films, have wide applica- 
tion and may be prepared from herapathite (quinine sulphate perio- 
dide), a strongly pleochromatic substance. A polaroid is a trans- 
parent plastic film consisting of submicroscopic crystalline hera- 
pathite needles oriented in a single direction. To orient the crystals, 
a viscous mass of the crystals is placed between two glass plates 


are essential to secure 
pleochromatic properties. Pure iodine polaroids may be produced 


e second the analyser 
(see Fig. 175). If a beam of natural light of intensity Jo impinges on 
the polariser, a linearly polarised light of intensity + Io emerges 
from the prism. Naturally, turning the polariser about its axis in 
no way changes the intensity of the transmitted beam, By means 
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of the analyser, it may be shown that the Nicol prism has indeed 
transmitted a linearly polarised beam. If the Nicol prisms are 
oriented with their “slits” parallel to each other, the light will 
be transmitted through the analyser as well without change in 
inlensity (disregarding absorption in the material of the polarising 


device). For crossed prisms, i.e., when the “slits” are at right angles 
to each other, light is not transmitted (see Fig. 175). The intensity 
of the light when an angle œ is formed between “slits” is J = 
= 1/,J9cos*a. Thus, the electric vector of the wave arriving at the 
analyser may be resolved into two com- 
ponents—one parallel to the “slit” and 
the other perpendicular to it. Since the 
component that is passed is Æ cos a (see 
Fig. 176), the intensity is proportional to O 
cos*a. 

The first prism will already reveal 
whether or not the light was partially or 
completely polarised. tee 

Using two Nicol prisms, it is not pos- Fig. 176 
sible to distinguish circularly polarised 
light from natural light, and elliptically polarised light from par- 
tially polarised natural light. To do this, the quarter-wave plate 
may be used. If it is placed before the polariser, it in no way affects 
naturally polarised light, but circularly polarised light is trans- 
formed into linearly polarised light. Similarly, a quarter-wave plate 
changes the properties of elliptically polarised light. 


| 
|Epass = cosa 


Analyser aris 
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The method of investigating transparent anisotropic substances 
by observing the behaviour of linearly polarised light impinging 
on them is very widespread. In order not to complicate the problem, 
assume that we are dealing with a crystal plate cut parallel to the 
optic axis. This plate ís placed between Nicol prisms. 

From the polariser, a linearly polarised beam impinges on the 
plate. Remove the plate and place the analyser in a crossed posi- 

26* 
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tion. Light is not transmitted. Now put the plate back. The field 
becomes illuminated, i.e., the beam of light is transmitted through 
the system. There can be only one explanation, namely, the crystal 
plate has changed the polarised state of the beam coming from the 
polariser. By means of the analyser, we may determine the exact 
nature of the change. If upon turning the analyser a new position 
for darkness is found, the conclusion is that the crystal plate has 
changed the direction of the beam oscillations but has left them lin- 
early polarised. If the intensity of the light is not changed by turn- 
ing the analyser, the conclusion is that the plate has transformed 
the linearly polarised light into circularly polarised light. Finally 
if the light is not extinguished by turning the analyser or plate, 
but the intensity of the light is changed thereby, the conclusion 
is that the plate has created elliptically polarised light. 

The changes produced in linearly polarised light depend on two 
things—the mutual orientation of the plate’s optic axis and the 
oscillation direction of the beam coming from the polariser, and the 

2 
= (no — Ne) created by the plate between 
the ordinary and extraordinary waves into which the incident wave 
is divided. 

If the substance placed between the crossed Nicol prisms is iso- 
tropic, no illumination of the field occurs. The phenomenon de- 
scribed above may be utilised in the investigation of anisotropic 
substances. 

Usually, observations are made using crossed Nicol prisms be- 
tween which the plate is rotated. During such rotation, the illumi- 
nation does not remain constant. At every instant, the amplitude A 
of the light emerging from the polariser is resolved into the compo- 
nents A cos p and A sin P, where @~p is the angle between the polariser 
“slit” and the optic axis of the plate. The terminus of the electric 
vector of the wave emerging from the plate describes an ellipse: 

Ex Ey 2ExEy W 
A? cos2p fi A? sin? p -2cospsing cos 6 = sin? ô, 
where 6 is fixed and @ changes continuously. Fig. 177 shows ellipse 
transformations for the case of a quarter-wave plate, i.e., in tlre 
above formula ô = 90°. For different values of p, different polar- 
ised states are obtained. 

Since the path difference ô depends on the w 
tern is coloured when white light is used. If the 
thickness, it will have a single colour that diffe 
ent relative orientation of the plate and Nicol 
certain wavelengths of white light, 


phase difference ô = 


avelength, the pat- 
plate is of uniform 
rs for every differ- 
prisms. Thus, for 
the plate thickness may be 


equal to A , for others it may be equal to x , and for still others a mul- 
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tiple of 4. Accordingly, for different wavelengths there arise differ- 
ent polarised states, which are transmitted by the analyser to 
a greater or lesser degree for different relative orientations of the 


plate and Nicol prisms. The phenomenon of chromatic polarisation 
is very beautiful. It is hardly likely that the richness of shades and 
hues observed when the thickness of a plate changes (e.g., in crys- 
tal growth), or when the orientation 
of a plate is varied relative to Nicol 
prisms, may be achieved by any other 
means. 

If a plate is of variable thickness, 
the interference fringes are rainbowed 
when observed in white light. 

In addition to these patterns of fringes 
representing equal thickness, dis- 
tinctive fringes representing equal 
inclination may be observed if the 
crystal plate is observed using con- 
verging rays (iconometric investiga- 
tion). These observations may be made 
on small crystalline granules in the 
field of vision of a microscope. Their 
practical significance lies in the de- À xd i 
termination of crystal symmetry. In particular, it is not difficult 
to determine to which of the following three groups an object belongs: 
1) amorphous or crystalline substances having cubic symmetry; 
2) uniaxial crystals; and 3) crystals having symmetry of lower or- 


Fig. 178 
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the relaxation time, i.e., the time required for the molecules to 
assume the appropriate orientation in the electric field, is of the 
order of one-thousand millionth of a second. Therefore, electric 
oscillations modulated by sound may be transformed into light 
intensity variations. This makes it possible for sound to be recorded 
on photographic film. 


152. Optical Activity 


The ability of certain substances to change the oscillation direc- 
tion of a linearly polarised beam is known as optical activity. The 
phenomenon may be described as follows. Consider an arrangement 
of crossed Nicol prisms with an optically active substance placed 
in the path of a beam. The field becomes illuminated, but the 
illumination disappears when the analyser is turned by some angle a. 
Thus, linearly polarised light transmitted through an optically 
active substance remains linearly polarised, but the oscillation 
direction of the beam changes by the angle a. Experiment shows 
that the change in oscillation direction is strictly proportional 
to the thickness of the layer of substance: 

a=pod. f 
The constant p characterising the substance is known as the specific 
rotation constant and is usually expressed in degrees per millime- 
tre. The phenomenon exhibits dispersion since p is a function of 
the wavelength. Normally, p decreases with increasing wavelength. 

The change in oscillation direction is quite considerable and 
in the case of many substances attains a value considerably greater 
than ten degrees per mm for a number of wavelengths. For water 
solutions of organic substances, the rotation of the polarisation 
plane is a function of the concentration, i.e., @ = ped, where c is 
the concentration. 

What kind of substances are optically 
substance must be composed of structural units which have neither 
a plane of symmetry nor a centre of symmetry among their elements 
of symmetry. In the case of molecular substances, such components 
are, as a rule, molecules. In the case of crystals, in which mole- 
cules are not distinguishable, such components are unit cells. 

Molecules, or cells, satisfying the above conditions, may be en- 
countered in the form of two optical isomers, designated by the letters & 
and J (dextro and levo, or right and left). An object and its image 
in a mirror are optically isomeric. A substance consisting of 
d-molecules (or cells) rotates light to the right, while one consist- 
ing of l-molecules rotates light to the left. By rotation to the right > 
we mean the case in which the analyser must be turned to the right 
(from the viewpoint of an observer facing the oncoming beam of 


active? An optically active 
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light) to restore darkness when the thickness of the layer of sub- 
stance is increased. Reversing the direction of the light does not 
change the sign of the effect. 
Optical activity may occur for substances in the liquid as well 
as in the solid state. It is merely necessary that there be a surplus 
of d- or l-molecules. The orientation of these molecules may be either 
random or uniform. In the first case, the body is isotropic and 
the rotation is the same irrespective of the direction of the beam of 
light. In optically active crystals, the 5 
magnitude of the rotation @ is a func- 
tion of the beam direction relative to 
the crystal axes. 
When molecular crystals that rotate 
light are fused, the structural compo- 
nents remain intact. In such cases, the 
solid as well as liquid substance pos- 
sesses optical activity. An example 
of this is sugar, which, in addition, 
possesses activity in solution. This 
property of sugar is utilised in the 
saccharimeter to determine the amount of sugar in a solution from the 
magnitude of the change in the oscillation direction of a beam of light. 
The situation is different in such crystals as quartz (see Fig. 180). 
The arrangement of atoms in a quartz cell satisfies the necessary 
conditions, namely, it does not have a centre of symmetry nor 
a plane of symmetry. The molecules are not distinguishable in 
a quartz crystal; as a result, the arrangement of atoms changes upon 
fusion. Hence, in fused quartz the necessary structural units are 
not present. Fused quartz is, therefore, optically inactive. 
- One and the same substance, from the point of view of chemical 
the optically active form and 

applies not only to quartz. 
le resemblance to the 


Fig. 180 


composition, may be encountered in 
in the optically inactive form. This 
The structure of an inactive variety bears litt 
structure of crystals possessing optical activity. 

The above is quite understandable in the case of ionic and homo- 
polar crystals. But how can an inactive crystal be formed from 


active molecules in the case of a molecular crystal? This occurs by 
A racemic mixture is a mixture 


the formation of racemic crystals. r d 
of equal quantities of d- and l-molecules. Such a mixture does not 


rotate light since the two opposite effects are equalised. A racemic 
crystal consists of pairs of d- and l-molecules. Every pair consti- 
tutes a centrosymmetric group of atoms. 

Optically active crystals exist in both the d- and l-form. The 
structures of such crystals are identical in the same sense that right 
and left gloves are identical. In the case of molecular crystals, this 
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means that in one case the structure is composed of d-molecules and 
in the other of l}-molecules. Dextro- and levo-quartz, dextro- and 
levo-glucose, dextro- and levo-tartaric acid—all the properties of 
these substances, all the details of their structure, are the same 
except for the fact that light is rotated in different directions. 

Inorganic optical isomers (e.g., dextro- and levo-quartz) are 
-encountered in nature in equal quantities. This is not the case with 
organic molecules, which are important in biology. The French 
chemist Pasteur showed that a number of microorganisms are capa- 
ble of feeding on only a specific optical isomer. 
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How is the phenomenon of optical act 
answering this question, we shall show tl 
beam is equivalent to two circularly polaris 
and levo-rotated. 

Let us write the equations for the electric 
taking into account that there is a phase differ 
‘circularly polarised waves. For the dextro-ro 


ivity explained? Before 
lat a linearly polarised 
ed beams that are dextro- 


vector oscillations, 
ence 6 between these 
tated waye 


Tope Ey coswt and B= Ey sin wt; 
for the levo-rotated wave 


E= Eo cos (wt +ô) and B= — Ey sin (wt + ô). 
The total field has the components 


E,=ES+ EF and Ey = Bi Rt 


To determine the polarised state of the resulting oscillation, let 


« 
us calculate the ratio ze for the total field, Using simple trigonomet- 
ae 
ric transformations, we obtain 


Ey ô 


Te —tan>. 


The ratio is independent of time. It is seen that we are dealing with 
linearly polarised oscillations that form 
z-axis. Q.E.D. 

On the basis of this conception 
oscillation direction is quite easi] 


an angle 2: with the 


z Signifies that the levo-rotated wave 
is lagging behind the dextro-rotated wave 


I the d (or vice versa, depending 
on the rotation direction) by the angle ô 


- In view of this expla- 
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nation, it is clear why a discussion of optical activity has been includ- 
ed in the chapter on double refraction. Here, too, the wave is 
divided by the substance into two components, one moving faster 
than the other and continuously advancing relative to it in phase. 
From this viewpoint, the specific rotation is proportional to the 
difference in refractive index between the dextro- and leyo-rotated 
beams. 

This discussion has in no way advanced our understanding of 
the phenomenon. We have merely given it another (completely equiv- 
alent) interpretation. However, this new approach enables us to 
more easily explain optical activity. Circularly polarised waves 
that are dextro- and levo-rotated travel through a substance with 
different velocities. They encounter different indexes of refraction 
and, hence, different permittivities and polarisabilities. The dis- 
placements of the electron cloud under the action of these two waves 
must differ. One wave experiences more difficulty than the other 
in displacing electrons from their equilibrium positions. If we can 
determine the reason for this difference, the explanation for optical 
activity will have been found. ; 

We know from chemistry that if a molecule has an asymmetrical 
carbon atom, the substance may exhibit optical activity. By an 
asymmetrical carbon atom chemists mean a carbon atom bound to 
each of four different atoms or radicals. 

The angles formed between the bonds of a tetravalent carbon 
atom are approximately equal to those in a tetrahedron. Fig. 184 
shows a molecule containing an asymmetrical carbon atom. The 
radicals or atoms bound to C differ, but their nature is unimportant. 
We see, in the first place, that two different molecules of such a sub- 
stance, which are mirror images of each other, are possible. These 
are optical isomers and cannot be made to coincide. This may be 
easily demonstrated using wire models. ’ ‘ 

Consider a circularly polarised wave travelling along the axis 
of symmetry of the bonds. In Fig. 182 the wave is directed outward 
from the page. Atoms A and B are higher than atoms D and Æ. Let 
us determine the directions of the electron displacements for dextro- 
and levo-rotated waves. Assume the situation is as follows for the 
dextro-rotated wave: When the vector # is directed along ED, in 
the upper “level” it is directed along BA. If that is the case, the 
situation is as follows for the levo-rotated wave: When the vector 
E is directed along ED, in the upper “level” it is directed along AB. 

Examining the figures, we see that the displaced electrons behave 
differently. In the first case, the electrons of atoms A and D move 
simultaneously away from the centre. In the second case, when 
the electrons of A move toward the centre, the electrons of D move 
away from the centre. Such differences will always be found for 
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THE THEORY OF RELATIVITY 


154. Basie Theory 


The theory of relativity, developed at the beginning of this 
century by the great modern physicist Albert Einstein, is based on 
two postulates: 1) the principle of relativity and 2) the principle 
of constancy of the velocity of light. We shall briefly consider the 
essence of these principles, describe the experiments confirming 
them, and discuss certain consequences of the theory. 

The theory of relativity originated with the questioning of the 
existence of a mechanical carrier (ether) for an electromagnetic field. 
The theory of relativity solved this problem and in this sense may 
be viewed as the perfection of electromagnetic field theory. While 
solving the problems posed by electrodynamics, the theory of rel- 
ativity went much further. Its development led to the establish- 
ment of the laws of mechanical motion at velocities close to that 
of light, to the law of the equivalence of mass and energy, and to 
new views on the nature of gravity. Since our discussion will be, 
of necessity, very brief, we are forced to dispense with an histori+ 
cal narration. 

First, as to the essence of the main postulates. The principle 
of relativity states that all laws of nature (and not only the laws 
of mechanics) are the same in all inertial systems of coordinates. 
The principle states that not a single physical experiment could 
discover special properties for one of the inertial systems. All iner- 
tial systems are equivalent. 

The second postulate pertains to the constancy of the velocity 
of light in a vacuum for all inertial systems. From this it follows 
that the velocity of light in the “receding” and “approaching” direc- 
tions must be the same, i.e., that the velocity of light is independent 
of the light source and measuring instruments. 

How do these principles affect o 
magnetic field and its carrier? It is 
mulations of the principles that e 
sound waves, are not analogous. 

Imagine a laboratory isolated from the external world, moving 
rectilinearly and uniformly relative to the stars. In this laboratory, 
measurements are made of the velocity of sound in the direction of 
motion. Theoretically, two extreme cases are possible: in one, the 


ur views concerning an electro- 
not difficult to see from the for- 
lectromagnetic waves and, say- 


a 
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walls of the laboratory are impervious to air, so that the air is car- 
ried along by the laboratory; in the other, the walls are pervious 
to air, the air is stationary relative to the stars, and the laboratory 
moves through the air, i.e., without carrying it along. Assume in 
these two cases that measurements are made of the velocity of sound. 
The velocities are measured by two observers—one moving and the 
other stationary relative to the stars. In each case, the velocities of 
sound relative to these two observers will differ. If the velocity of 
sound in air is designated by c and the velocity of the laboratory 
relative to the stationary observer by v, then, in the case in which 
the air is carried along, the moving observer finds the velocity equal 
to c and the stationary observer finds it equal to c + v; in the case 
in which the air is not carried along, the moving observer finds the 
velocity equal to c — v and the stationary observer finds it equal to c. 

The postulates of the theory of relativity reject both variants 
in the case of an electromagnetic wave in an ether. In experiments 
with light waves, the velocity of light will be equal to c for the sta- 
tionary as well as the moving observer. This means that a station- 
ary as well as a moving ether is incompatible with the theory of 
relativity. Thus, the theory of relativity rejects the possibility of 
viewing the field as a medium in which mechanical displacements 
occur. We must conclude that electric and magnetic fields have 


a real existence. 


455. Experimental Verification of the Principle 
of Constancy of the Velocity of Light 


At first glance, the principle of constancy of the velocity of light 
seems to fly in the face of “commonsense”. Therefore, before discuss- 
ing certain consequences of the theory of relativity, it is desirable 
to describe the direct experimental evidence for its validity. This 
evidence is derived from astronomical observations. 

Astronomers have discovered the existence of so-called double 
stars. A double star consists of two heavenly bodies of approximately 


“the same mass rotating about their overall centre of gravity. We 


have the means to measure the distance between the stars, their 
mass and their velocity; also, to determine their relative motion. 
If the velocity of light depended on the velocity of the star itself, 
the velocity of the heavenly body would be added to the velocity 
of light when this body moved toward a terrestrial observer and 
subtracted when this body moved away from the terrestrial obsery- 
er. In such a case, to the terrestrial observer, the motion during 
one half of the orbit would appear faster than during the other half. 
This effect would be detectible even if the velocity v of the heavenly 
body were one-hundred thousandth of the velocity of light c. 
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3 2 Sa l l 
Thus, for a long distance l, the difference in times —— and eer 


may be so considerable, even for very small values of v, that not 
only is the periodicity disturbed, but a light beam transmitted 
during motion in the “receding” direction may overtake a beam 
transmitted during motion in the “approaching” direction. Then, rota- 
tion of the star would not be visible or would be of a peculiar nature. 
The periodic rotation of double stars may be understood only on 
the basis of the principle of constancy of the velocity of light. 

To be sure, our discussion has dealt with the motion of a light 
source, so that there may remain some doubt regarding the validity 
of the principle of constancy of velocity for the motion of an observ- 
er. Such doubt may be removed by another astronomical observa- 
tion, i.e., observation of the periodicity of the motion of Jupiter’s 
satellites. Measurements of the motion of Jupiter’s satellites may be 
made in two cases—when the light arriving on the Earth from Jupi- 
ter coincides with the direction of motion of the solar system and 
when it is in the opposite direction. The identicalness of the obser- 
vations and the distinct periodicity of Jupiter’s annual motion 
demonstrate the validity of the principle of the constancy of the 
velocity of light in this case as well. 

The most important role in the development of the theory of 
relativity was played by an experiment first performed by Michel- 
son in 1881 with the aid of the interferometer described on page 360. 
This experiment consisted in the following. The locations of two 
mirrors, i.e., the arm lengths J; and l, were selected in such a man- 
ner that the coherent beams into which a light signal is divided 
would require the same amount of time to cover the distances along 
the two arms of the interferometer. This selection is made when 
the interferometer is arranged in such a manner that one of the 
arms is parallel to the motion of the globe in its orbit. The instru- 
ment is then turned 90° and the interference fringes observed for 
possible displacement. 

The results of the Michelson experiment, which was repeated 
many times by Michelson and other investigators, are the following: 
no displacements of the fringes occur and the times required for 
light to cover the distances along the arms remain equal when the 
instrument is turned 90°. This conclusion is based on very accurate 
measurements. 

What is the significance of this experiment? Since the Earth 
moves with a velocity v % 30 km/sec relative to fixed stars, the dis- 
tances covered by the two rays cannot be the same from the view- 
point of a celestial inertial observer. 

Let us examine the paths of the two rays (see Fig. 183). Of course, 
we need only concern ourselves with the portions of the path along 
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which the beams travel separately. The longitudinal beam in the 
“receding” direction must cover the distance of the arm length J; 
and overtake the mirror moving with a velocity vin the same direc- 


Fig. 183 Fig. 184 


tion, Therefore, the distance ct; covered by the beam must equal 1, + 
-+ vt;. The time required for the wave front to reach the mirror is 


wee 
so 
als) 
In the “approaching” direction, the beam covers the distance of 


the-arm length J; minus the distance covered by the approaching 
instrument. Therefore, the distance ctz covered by the beam must 


equal 1; — vtz. Then, 


q= 


ee. 
ane 
c (142) 
The time tı + Tz measured in the experiment is equal to 
2l; 


v 
c (1-5) 
i i the transverse beam (see 
let us direct our attention to t 
Fig 1a), During the total time T, i.e., the time elapsed from the 


instant the beam leaves the centre of the instruments to ie instant 
it returns, the mirror is displaced as shown in the figure. Therefore, 
’ 


T = 


27—1409 


~~ - 
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the distance covered by the wave is 


er=2// b+ (3); 


whence, the time t is equal to 
Qly 


-7 
yez 


In the first measurement, the arms J, and Z, were selected in such 
a manner that the times required for the beams to cover the separate 
paths were equal. Hence, 


QU, 2i & toe 


6 = 2 or Yt _~),, 
v? ENOR, y v2 
e (1—5) TARE VE 
However, in the second experiment, i.e., when the interferometer 
is turned 90°, there is no interference fringe displacement and the 
times remain equal even though the arms 1, and l have interchanged 


places! This is the surprising result of this experiment. 
Thus, if the first arm is longitudinal, 


L=hV/1—2, 


if the second arm is longitudinal, 


v2 
L= 1 EA 
Forv = 0, 4 = lz; but for v + 0, we obtain a remarkable result: the 
length of one and the same segment differs, depending on whether 
this segment is parallel to the direction of motion or perpendicular 
to it, The obtained result is valid for any body and for any distance 
between two points. Thus, the first consequence of the theory of 
relativity is that a body moving relative to an inertial observer 
shortens its dimension in the direction of motion. The transverse 
dimensions remain unchanged. If an observer relative to whom an 
object is stationary finds that the length of this object is 1), an observ- 
er relative to whom this object is moving with velocity v will find 


that its length is 
v2 
l= V 1—5. 


Example. If an object moves with a velocity of 1,000 km/sec relative to 
some “stationary” observer, the length of the object in the direction of motion 
appears to be 1 divided by 1.000005. If the velocity of the object is 


200,000 km/sec, then J 4,34, 
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The length of one and the same segment moving in a specific man- 
ner in different frames of reference, will differ. It is necessary to under- 
stand properly the relative nature of the above contraction. Take 
two rods of the same length lọ and assume that their relative velocity 
is v. Now, assume that there are two observers—one moving with the 
first rod and the other with the second. Insuch a case, the first observ- 
er will find that his rod has a length lọ and the other a length 


lo TEE For this observer, the second rod will be shorter than the 
first. On the other hand, the second observer finds that the second 


S 2 
rod has a length Jp and the first a length by 1— =. For him, the 


first rod is shorter than the second. 

The length of a rod (or, in general, the distance between two 
points) is a relative concept. Of all rod lengths measured in various 
inertial systems, the rest length lọ is outstanding. This maximum 
length of a rod has absolute meaning. 


156. Time in the Theory of Relativity 
In the expression relating the length of a rod at rest to the length 


of a rod in motion, the factor V1 — p? appears, where B = = . This 


factor also appears in analogous formulas relating the values of 
various physical quantities for stationary and moving observers. 
Using an approach similar to that taken in the preceding article 
leads to interesting results as regards time and acceleration, mass 
and force, momentum and energy, density of charge and current, 
field intensities, etc. The formulas of the theory of relativity enable us 
to convert values determined by a stationary observer to values 


. . . Us a 
determined by a moving observer. The ratio B = > isin all cases 


an important criterion of the need for a relativistic correction. 

It is easily seen that B* is comparable to unity only when the 
velocity is very large. Even when v = 100,000 km/sec, V1 — f? is 
only several per cent less than unity. It is, therefore, clear that the 
theory of relativity yields negligible corrections when the motion 
is slow, i.e., in such cases it is not necessary to take into account 
the changes in physical properties with motion. The theory of rel- 
ativity is of particular importance for the microworld, where par- 
ticles having velocities approaching the velocity of light are en- 
countered quite often. 

Let us direct our attention to the consequences of the theory as 
regards time. It turns out that the interval t during which an event 
occurs is also not the same from the viewpoint of two different iner- 


27* 


ae 
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tial systems. Thus, two events occurring simultaneously from one 
viewpoint occur at different times—one earlier and the other later— 
from the viewpoint of another frame of reference. i 

Qualitatively, this assertion follows immediately from the prin- 
ciple of the constancy of the velocity of light. Thus, considera sys- 
tem moving uniformly and rectilinearly relative to another inertial 
system. In one of them, there is located a radiator from which light 
is radiated in all directions. In this system, let us select two points 
equidistant from the source of light along a straight line in the 
direction of relative motion. It is clear that in this system the light 
arrives at both points simultaneously. This is the situation from the 
viewpoint of an observer moving together with the source. However, 
to an observer in the other system the situation appears to be differ- 
ent. To this observer, one point is moving toward the signal and 
the other away. Since the velocity c has the same value for this observ- 
er too (moreover, the same in both directions), from his viewpoint 
the light arrives earlier at the point that is behind. 

A doubt may arise: cannot such a conclusion lead to absurdities? 
One may reason that since the concept of simultaneity is relative, 
it may happen that from the viewpoint of one frame of reference 
a gun is fired and then a wounded bird falls from a tree, but from the 
viewpoint of another frame of reference the bird falls before the 
gun is fired. Careful analysis shows that the relativity of a sequence 
of events is limited by the velocity of propagation of interaction 
(less than c). Therefore, “earlier” and “later” may interchange places 
only when they are not causally related, i.e., when they are not the 
result of interaction. 

A very interesting result of the theory relates to the proper time 
of an object, i.e., the time determined by a clock moving together 
with a given body. If a time t has elapsed according to the clock 
of an observer in a certain inertial system, the handle of the clock 
moving with the object will have advanced by the time 


%=tV 1p. 
This means that a clock moving in any arbitrary manner moves 
slower than a stationary clock.* 

It is necessary to comprehend properly the relative meaning 
of this assertion. If two observers are in different inertial systems, 
‘each will assert that the clock of the other observer is slow. This 
would seem to be a paradox. Let us stop the observers and compare 
their clocks. However, to perform this check, at least one of the 
observers must perform a complete circuit. Upon returning to the 


* This formula has found experimental confirmation in experiments with 
mesons (see p. 597). ? 
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may be compared. But now the deter- 
ve lost their relativity. The observer 
remaining in an inertial system is justified in applying the above 
formula. The observer executing the circuit has undergone accelerated 
motion; hence, the formula To = +V 1 — p° cannot be used by him. 
Thus, after the observer executing the circuit returns and the clocks 
are compared, it turns out that his clock is slow. Moreover, he cannot 
“dispute” this result by reference to the principle of relativity, for 
this principle is valid only for jnertial observers. 


point of departure, the clocks 
minations of the observers ha 


457. Mass 


If the mass of a body measured in a system of coordinates to 
which it is bound is designated by mo, to an observer relative to 
whom this body moves the mass appears to be 


mo 
==: 


OVE 


The quantity mo is k nass and the increase in mass 
with increasing velocity is a nai : awe 8 the Aarne 
tal principles of the theory- The velocity of light ¢ cons itutes 
a limiting velocity for any motion or transfer of interaction. For 
VW. -theemass of al bodyabecomes infinite. Of course, the closer 
a body approaches the limiting velocity, the more difficult it is to 
accelerate it. 3 

The increase in mass with A; 
for the electrons of f-rays aS early as 


Since the electron velocity V and the rati i 
independently (see P- 465), and since the se ae remains 
unchanged, we are able to check thertorg N ortant ole i 
Corrections given by the factor V 1 FE p TA The ariel 
the construction of accelerators of charge par A TA EN Ri 
velocities attained in modern accelerators are so & a 


; È a value of 0.9986; thus, the mass 
Bee ets crib ee rest mass. In all experiments 


omes 60 times heavier t r : 3 
i iti pic bodies, 

C t T rjal conditions with macrosco 

nducted under terrestr N ‘ 


. — B? correction the 
E E T k: check its validity not only for ele- 


i een ible by means of precise astronomical 
mentary particles. Eve baa a the change in mass of the planet 


ob $ i TI R, 
aE a orbital motion explains the small deviations of 
the orbit from an ellipse. _ 


increasing velocity was first detected 
he beginning of this century. 


o < may be determined 
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The momentum formula acquires the following form when we 
substitute the expression for the mass of a moving body: 


mov 
Pe i= 
It should be noted that Newton’s law remains valid if it is writ- 
ten as F = 2 . On the other hand, the formula F = ma will no longer 
be valid in all cases. 
158. Energy 


In Sec. 10, we obtained an expression for the energy of a moving 
body by finding a function that increases as the work expended in 
accelerating the body. Let us repeat these calculations taking into 
account the corrections provided by the theory of relativity. 

The work of displacing a body by a distance dł is 

Fdl= oe dl = dp a vdp, 


where v is the velocity vector. If this work serves to increase the 
energy & of the body, then 


dg =v dp = vd (mv) = mv dv +v? dm. 


Since m= then dm = Aas ; hence, 
ERI v2 £ _ mv dv 
dé = (1 + a) mvdv, i.e., de = pay ' 


Comparing the last expression with the formula for incremental 
mass, we find: dg = c*dm, ile., 


G=mc?. 


We have dropped the additive integration constant, for when m = 0, 
6 must also be equal to zero. 

Thus, the work done on a body serves to increase the function 
& = me", which has, therefore, the significance of energy of the body. 
The fundamental result of this calculation consists in the follow- 
ing: an increase in the mass of a body is accompanied by an increase 
in its energy (and, hence, an expenditure of external energy); 
on the other hand, a decrease in the mass of a body or system is accom- 
panied by a decrease in its energy (and, hence, a transfer of energy 
to its surroundings). There is a direct and universal relationship 
between mass increment and energy increment, for c? is a constant 
quantity. 

But what is the nature of the energy 6? Is it an energy of motion? 
Evidently not. If the body is at rest, € does not equal zero but 


159. Mass Defect 423 


equals myc. Therefore, U=moc* is the rest energy of the body, i.e., 
the internal energy of the body, and the difference mc? — moc? is 
the energy of motion. 

The first part of the last sentence should be viewed as an asser- 
tion that may be verified experimentally. As for the energy of mo- 
tion, me?— moc, this will be recognised as the familiar expression 
for the kinetic energy if the following approximation is used: 


eS aoe pe 
VE r5 
To this degree of accuracy, 
2 2 il Fe akin. uous 
me? — mc? = moc” ( Vine 1) = moc? X rl = 


Example. The internal energy of a body of mass mp = 1 kg is U = moc? = 
= 9 X 10! joules = 2.16 X 108 kcal. This is the equivalent of the quan- 
tity of energy that would be released in the form of heat in the combustion 
of 3 million tons of coal. Even in thermonuclear reactions, only several per 
cent of these tremendous reserves of internal energy are released at present. 


159. Mass Defect 


As was indicated in the preceding article, the expression relating 
rest mass to the internal energy of a body, i.e., U = moc”, may be 
verified experimentally. 

The internal energy of a body consists of the rest energy of the 
component parts, their kinetic energy and:their potential energy 
of interaction. A change in any of these component energies affects 
the value of U and, hence, the rest mass as well. Thus, the rest mass 
increases if the temperature of the body rises, i.e., if the internal 
motion of the system increases. The rest mass also increases if repel- 
ling components of the system approach one another or if attracting 
components move apart. 

It is clear from the above that the rest mass of a system of inter- 
acting particles does not possess the property of additivity, i.e., 
it is not subject to the law of conservation. If a body of rest mass 
Mo consists of N particles, each of mass mo, then My ~ Nimo. The 
difference 


M— Nm = AM 
is called the mass defect of the body (or system of particles). The 
quantity 
eAM 
is called the binding energy. 
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If a system breaks up into a number of components, binding 
energy is released and may be measured. Moreover, the rest mass 
may also be directly measured. Thus, the U = moc? law may be 
verified experimentally. he. 

Numerical examples show that any change in internal energy 
related to a change in the velocity of motion and the interaction 
force between molecules and atoms cannot lead to a measurable 
change in mass. Experimental verification of this theory is possible 
in nuclear physics (see p. 540). : 

a f 4 kg of molybdenum increases by AM = 
= 600000003 gm ther E shorea tS 4,000°C. M 


2. If a steel rod of 128 cm length and 4 cm? cross-section (mass of the rod = 


= 1 kg) is stretched by a force of 8 tons, the potential energy thereby stored 
in it increases its mass by 2 x 1071? gm. 


160. The Principle of Equivalence and the General Theory 
of Relativity 


Let us consider a noninertial system of coordinates moving with 


an acceleration ao. Assume that we wish to describe physical phe- 
nomena in this system. The laws of mechanics in this system will 
appear different than in an inertial system, for F = ma is valid only 


for the latter. A stationary body will have an acceleration —ay 
relative to this system. 


If we maintain the terminology used for an inertial system and 


assume that acceleration is produced by forces, then the “force” 
field—may acting on all bodies in an accelerated system may be 
called an acceleration field and an analogy may be drawn between 
this field and a gravitational field. O id 

In exactly the same manner, we may introduce additional “force” 
fields in considering phenomena in a rotating system of coordi- 
nates and, of course, in the general case. The fictitious force fields 
that we have introduced for the description of motion from the 
viewpoint of a noninertial system of coordinates may be called fields 
of inertial forces. The force—mao is an inertial force. 


The motion of a point having an acceleration a relative to such 
a noninertial system will obey the equation 


ma= F -inertial forces, 


Expressions for inertial forces may be found in textbooks on 
theoretical physics. 


It is important to direct our attention to the theoretical side of 
this question. In noninertial systems, fictitious force fields appear. 
To each such acceleration field there corresponds a fictitious distri- 
bution of attracting mass. Hence, any field created by accelerated 
motion may be interpreted, generally speaking, as a gravitational 
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field. In this sense, we sometimes speak of the equivalence of gravi- 
tation and acceleration. 

Let us consider several simple examples. Assume that we are in 
an elevator falling with an acceleration a. Let us drop a ball and 
examine the nature of its fall. As soon as the ball is dropped it be- 
gins, from the viewpoint of an inertial observer, to fall freely with 
acceleration g. Since the elevator is falling with acceleration a, the 
acceleration relative to the elevator floor is g — a. An observer in 


the elevator can describe the 
a- 


motion of the falling body by 
means of the acceleration 
g’ = g — a. In other words, 
the observer in the elevator 
need not speak of the acceler- 


ated motion of the elevator = 
since he has “changed” the ai, 
acceleration of the gravitation- 
al field in his system. | An 

Now, let us compare two Wij 
elevators. One is suspended i 
over the Earth and the other y 
moves in interplanetary space 
with an acceleration æ rela- Fig. 185 
tive to the stars. All bodies in 
the elevator suspended over the Earth are able to fall freely with 
acceleration g. But bodies inside the interplanetary elevator have 
a similar capability. They “fall” with an acceleration—a te the 
“bottom” of the elevator. The role of bottom is played by the wall 
opposite to the acceleration direction. 

Thus, the action of a gravitational field and the manifestation of 
accelerated motion are indistinguishable. 

The behaviour of a body in an accelerated system of coordinates 
is the same as the behaviour of a body in the presence of an equiva- 
lent gravitational field. However, this equivalence is complete only 
if we limit ourselves to observations over small portions of space. 
Thus, imagine an “elevator” having linear floor dimensions of several 
thousand kilometres. If such an elevator is suspended over the 
Earth, the phenomena occurring in it will differ from those occurring 
in an elevator moving with an acceleration g relative to fixed stars. 
This is clear from Fig. 185. In one case bodies fall obliquely to the 
bottom of the elevator, while in the other case perpendicularly. 

Thus, the principle of equivalence is valid for such volumes of 
Space in which the field may be considered uniform. 

The above qualitative considerations lie at the basis of the gen- 
eral theory of relativity. This theory was also developed by Ein- 
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stein. In it, he sought formulations for the laws of nature, independ- 
ent of the choice of coordinate system. Until now, we have assumed 
that this was possible only for inertial systems of coordinates. The 
principle of equivalence shows that the absoluteness of acceleration 
may be destroyed by a gravitational field. An accelerated system 
of coordinates may be viewed as an inertial system if we introduce 
an equivalent gravitational field. To be sure, as we have just seen, 
such equivalence is limited in time and space. However, Einstein 
showed that this restriction may be removed if, corresponding to 


the introduction of the gravitational field, a change in the geometry 
of the system is introduced. 


CHAPTER XXV 


THE QUANTUM NATURE OF A FIELD 


161. Photons 


On a number of occasions we have indicated that radiation and 
absorption of electrical energy occur in packets, or quanta. The 
magnitude of a quantum depends only on its radiation frequency and 
is equal to hy, where h is a universal constant equal to 6.62 x 10-37 
erg sec. It should be noted that the quantum nature of radiation 
and absorption has been established already for the entire electromag- 
netic spectrum, i.e., from hard y-rays to long radio waves. 

The phenomena of radiation and absorption characterise, in the 
first place, the microsystem interacting with the electromagnetic 
field of a wave. The quantum nature of these phenomena (which 
we shall discuss in detail in Part III) shows that a microsystem has 
distinct energy levels and that the values of these energy levels 
cannot be arbitrary. These facts by themselves would not have led’ 
to the conclusion that this quantum nature is characteristic of an 
electromagnetic field as well as of matter if an electromagnetic wave 
in its interaction with matter did not behave, in a number of cases, 
as a particle. The corpuscular properties of electromagnetic radiation 
are manifested when losses and transformations of electromag- 
netic energy occur. The shorter the wavelength, the more distinct 
the effects. These properties, on the-other hand, are not manifested 
during propagation, scattering and diffraction of electromagnetic 
waves if these processes are not accompanied by energy losses. 

A corpuscle of an electromagnetic field is called a photon. It 
is characterised, in the first place, by the magnitude of its energy: 


E= hv: 
Using the law of equivalence of mass and energy, we are entitled 


to ascribe to a photon the mass 
e hy 


OFE 


Since an electromagnetic field is propagated with a velocity c, it 
must be concluded from the formula m = —“2— that the rest mass 


VEE 


of a photon is equal to zero. 
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Assuming the concept of momentum is applicable to a photon, 
we obtain 


=e hy 
P earl 


It should be recalled that the Lebedev experiments (see p. ae) 
directly demonstrate the validity for light of the formula p = : 


the relationship between the momentum density and energy density 


of an electromagnetic wave. The formula for photon momentum is 
in complete agreement with this result. 


Values of 2, m and p for Photons of Various Types 
of Electromagnetic Radiation 


| A A e m (gm) p (gmem/sec) 
Radio waves 2,000 10-21 erg =0.62X 10-9 ev |1:1 x10712 3.3 x 10782 
metres 
Visible light 6,000 A 3.3 40-12 erg=2 ev (3.6% 10-33 1.4 10-22 
X-rays ... 4 A 


19.8 10-9 erg = 12,400 ev}2.2 10-29 6.6 10-19 


n phenomena. These phenomena are ele- 
gantly explained by the wave nature of the field, but are completely 
inexplicable from the corpuscular viewpoint. 


Thus, consider a simple interference arrangement—two close 


t may be transmitted. The following 


discussed earlier, namely, alternate 
let us close each of the apertures in 
graph on one plate. The result, of cou 
theory), will be different, i.e., 


succession and take the photo- 


rse (from the viewpoint of wave 
there wi 
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This experiment and many others show that it is quite impossible 
to reduce electromagnetic phenomena to only a field pattern or to 
only a system of photons. Each concept is exceedingly fruitful in the 
case of one group of phenomena, but fails in the case of the other. 

During the last few decades, physicists have energetically sought 
ways of reconciling these two contradictory views of electromagnet- 
ic radiation. A field is a reality characterised by continuous values 
of field intensity in space and time; a corpuscle is a reality occupying 
a certain limited region of space at a given instant. These contra- 
dictory qualities are combined in electromagnetic radiation. In 
Chapter XXVII, we shall see that these contradictory properties 
are combined not only in the case of electromagnetic radiation, but 
in the case of matter as well. However, physics has made consider- 
ably more progress in understanding matter than in understanding 
an electromagnetic field. The dual nature of particles of matter is 
described by the Schrédinger equation (see p. 474); interactions 
between corpuscles and waves for such particles are understood 
quite well. 

Unfortunately, the situation is much worse as regards electro- 
magnetic field (radiation) theory, commonly referred to as quantum 
electrodynamics (for a detailed discussion, see p. 556). Such a com- 
plete theory does not exist. In view of the fundamental contradic- 
lions existing in quantum electrodynamics, its partial successes, 
expressed in the establishment of new relationships between field 
and particles, cannot be generalised. Hence, the interrelation be- 
tween photons and electromagnetic field remains unclear. 

The rules of “translation” from corpuscular terminology to wave 
terminology, and vice versa, are based on the following: An electro- 
magnetic wave of length À and intensity J may appear as a stream 
of photons of frequency v = + and intensity J = Nhy, where N 
is the number of photons passing per unit time through unit area. 
The direction of motion of the wave front is the direction of motion 
of the photon. 

We shall not discuss in corpuscular terms the very complex 
problem of the polarised state of light. To do this, it is necessary 
to assume that a photon has a selected direction, or spin (see p. 496 
regarding electron spin). 


162. Photoelectric Effect 
The escape of electrons under the action of electromagnetic waves 


constitutes important confirmation of the indispensability of the 
corpuscular viewpoint. This phenomenon will be considered here 
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from this viewpoint, and again in Sec. 274 when we discuss the 
action of light on metals and semiconductors. r 

Since the escape energy of an electron from a metal (see p. 694) 
is not less than 2.2 ev, the photoelectric effect becomes possible 
when hv > 3.5 X 10-1 erg, i.e., for frequencies of the order of 
0.5 x 10 cycles (A = 6,000 A). 

Einstein proposed that the photoelectric effect be viewed as an 
effect of collision between a photon and an electron. In this process, 
the photon gives up all of its energy and ceases to exist. If A rep- 
resents the work function of the electron, i.e., the work required 
to overcome the binding force between the electron and the sub- 
stance, the law of conservation of energy has the form 

hoa A+, 


e is the kinetic energy of the photoelectron, the electron 
dislodged from the substance. 

The first means of checking the v 
thesis consists in verifying the linear 
tron kinetic energy and frequency 

The photoelectron energy is det 
method. If the surface of the subs 
are dislodged constitutes:a conden 
the circuit in which this condense 
flow when an appropriate bias vol 
This condition is given by 


where 


alidity of the photon hypo- 
dependence between photoelec- 
of incident radiation, 

ermined by the bias potential 
tance from which the electrons 
ser plate, current flows through 
r is connected. Current ceases to 
tage is applied to the condenser. 


mv? 
eUr ==>. 


It should be realised that the greater the depth from which the 
electrons are dislodged, the smaller the velocities. Therefore, cur- 
rent ceases to flow when the electrons closest to the surface are pre- 
vented from escaping. By experimentally determining eU, for var- 
ious frequencies v of electromagnetic radiation, curves of Uy vs. 
v may be plotted. The ideal straigl 


i ht lines obtained are shown in 
Fig. 186. The slope = of the straig = hv — A may be 


ht line eu, 

calculated from other data, providin another j a 
of checking the validity of the theory E na means 
Nevertheless, the above experiment cannot be considered direct 
proof of the photon hypothesis. The Possible objection is that the 
ly accumulate the energy transmitted 
wave. This objection was answered by 
Yoffe and N. I, Dobronravoy. 
hotoeffect by means of a par- 
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ticle of dust suspended between the plates of a condenser. Owing to 
inevitable air friction, the particle of dust carries a charge and, 
hence, its weight may be counterbalanced by an electric field. For 
equilibrium q£ = mg, where m is the mass and q is the charge of the 
particle of dust. In the photoeffect process, the particle of dust loses 
an electron and, hence, depending on the sign of q, changes its charge 
by q -+ e or q — e. The particle of dust is then no longer in equilib- 
rium and begins to move towards one of the condenser plates. To 
counterbalance the dust 
particle, it is necessary 
to change the field. The 
equilibrium condition is 
now 


(qe) Ey=meg. 


U,, volts 


In this manner, Yolffe 
determined the charge of 
an electron. 

Now, let us describe 
the Yoffe and Dobron- 
ravov experiment. Here, 
too, the behaviour of a 
dust particle suspended > 
between the two plates of a condenser was observed, but now the 
goal was different. The anode of an X-ray tube served as one of 

- the condenser plates. A voltage of 12,000 volts was applied to the 
tube and the X-rays were created by an exceedingly weak electron 
stream of about 1,000 electrons per second. 

As is well known, X-rays are created when an electron strikes 
an anode. But what is radiated by the anode? Is it a continuous 
electromagnetic field or 1,000 photons per second? The dust particle 
between the condenser plates enables us to obtain the answer. The 
ee dislodge electrons from the dust particle. But how do they 

o this? 

The Yoffe and Dobronravoy experiment showed that, on the 
average, one electron was dislodged from the dust particle every 
30 minutes. If the X-rays were propagated in the form of a contin- 
uous field, then at each instant the dust particle would have ob- 
tained a very minute amount of energy, insufficient of course to 
dislodge an electron. This energy would have been evenly distribut- 
ed among all the electrons of the dust particle. From the wave 
viewpoint, a quite inconceivable conclusion would haye to be drawn 
from the Yoffe and Dobronravoy observations, namely, that once 
every 30 minutes all the electrons transfer energy to one electron, 
which then escapes from the dust particle. 


Fig. 186 
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The photon hypothesis not only explains the phenomenon qual- 
itatively but quantitatively as well. The dust particle in the above 
experiment consisted of a bismuth spherule having a radius of 3 X 
x 107° cm. It was located at a distance of 0.02 cm from the anode, 
from which X-rays emerged in all directions. The probability of 
a photon striking the dust particle is Se =1,§00,000" 
Since in 4 second 4,000 photons are dislodged, on the average 1 pho- 
ton will strike the dust particle every 1,800 sec (30 minutes), which 
agrees with the experimental result. 


163. Fluctuations in Luminous Flux 


The experiments of S. I. Vavilov devoted to the study of fluc- 
tuations in luminous flux of low intensity provide important exper- 
imental corroboration of the photon theory. 

It turns out that the eye’s threshold of sensitivity to light is 
exceedingly low. The human eye is capable of perceiving approxi- 
mately 100 photons per second falling on the cornea. If the luminous 
flux fluctuates about this value, light will not be perceived by the 
eye when the number of photons drops below the threshold value. 

In the Vavilov experiments, the investigator observed a beam 
of light that was discharged every second for a time interval of 
0.4 sec. When the value of the luminous flux exceeded the threshold 
of sensitivity, the eye perceived every flash of light. When the 
light intensity was decreased, some of the flashes were no longer 
perceived by the observer. The lower the light intensity, the greater 
the number of flashes that were not perceived. Thus, fluctuations 
in the number of photons in the luminous flux were directly ob- 


served. It is difficult to provide more direct evidence of the corpus- 
cular nature of light. 


Other experiments 


uite impossible <plain i rence 
as some statistical distribtitio i aea ue 
Wave properties are inherent in every photon rather than in 
a stream of photons. Thus, a photon can in no way be viewed as an 
“ordinary” particle. 
At this point, we must digress somewhat. 
of the invisible world, we endow elementar 
borrowed from the world of things (mater 


In creating a model 
ry particles with properties 
ials) around us, or, as they 


eS 
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say in physics, from the macroworld. Thus, for example, atoms are 
conceived as spherules. Needless to say, an atom spherule only par- 
tially reflects the properties of a material spherule. Everyone knows, 
for example, that such properties inherent in a material spherule 
as colour, roughness and odour cannot be transferred to an atom sphe- 
rule. The more we penetrate into the microworld, the more difficult 
it is to endow elementary particles with material properties. 
Components of an atom or atomic nucleus and particles of light 
resemble a material spherule even less than an atom does. In the 
case of a photon, we saw that it is possible to combine in a micro- 
particle conflicting properties of the macroworld. Of course, in the 
macroworld, a particle is a particle and a wave is a wave. A particle 
occupies a limited region of space and travels along a definite path. 
A wave is distributed continuously in space and the energy is trans- 
ferred to one or another region from all points in space. For mate- 
rials, these two views are irreconcilable. But we have no right to 
impose the behaviour of materials on particles of the micro- 
world. : 
Cognition of the microworld does not consist in the creation 
of a model resembling the pictures familiar to the human eye. The 
infinite process of cognition consists in the investigation of the 
regularities of phenomena, the determination of objectively existing 
causal relationships between phenomena. In this manner, a complex 
picture of the microworld, whose essence cannot be transmitted by 
any ingenious model borrowed from the macroworld, is obtained. 


164. Kirchhoff’s Law 


It has been experimentally established that two bodies having 
different temperatures tend to equalise their temperatures even 
when the bodies are in a vacuum. The energy exchange occurs by 
means of electromagnetic waves radiated by the atoms of these 
bodies. 

As was indicated above, a specific system of energy levels is 
associated with every atom. When an atom absorbs energy, its 
energy level rises; when it radiates, its energy level decreases. Dur- 
ing every radiation process, an atom releases into space an electro- 
magnetic energy AVmn = Em — En, Where m is the energy level 
before radiation and @, the level after radiation. The radiated wave 
has a frequency Vmn. This wave arrives at the other body and is 
absorbed by it. In this case, the energy level of the atom absorbing 
energy is raised from En to Êm- 

The same thing may be expressed in terms of photons. Thus, it 
may be stated that during every radiation process a photon of elec- 
tromagnetic energy hv is released; during absorption the photon is 


28-1409 
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captured by an atom and its energy serves to raise the energy level 
of the atom. 

All the atoms of the bodies participate in the energy exchange— 
sometimes a photon is absorbed, sometimes a photon is radiated. 
Depending on the random circumstances, the most varied energy 
transitions may occur and, in principle, electromagnetic waves of 
any wavelength may participate in the energy exchange. ; 

Let us assume that the bodies participating in the heat exchange 
form a closed system, i.e., the system of bodies under observation 
is surrounded by an envelope that prevents radiation from passing 
through. Then, after a certain interval of time, these bodies reach 
a state of equilibrium and assume the same temperature. This does 
not mean that electromagnetic radiation ceases. As before, a tran- 
sition will sometimes occur to a higher energy state of an atom and 
sometimes to a lower. But if the equilibrium state has been reached, 
then for each body, at each instant of time, equal quantities of 
energy will arrive and leave. This is true for radiation of any wave- 
length. In general, the radiation arriving at a body is only partial- 
ly absorbed, raising the energy levels of its atoms. The other part 
of the incident radiation is scattered, i.e., reflected, by the body. 

Atoms do not maintain their high energy levels long: in returning 
to their original state, they give up the absorbed energy in the form 
of radiation. If the energy incident on a unit area in 4 sec is designat- 
ed by p, the absorbed energy may be expressed as Ap. The dimen- 
sionless coefficient A, indicating the fraction of energy that is ab- 
sorbed, is known as the absorptivity of the body. Evidently, if 


Ap=6, 
where & is the energy radiated from 1 cm? of surf: 


body is in equilibrium with its surroundings and 
does not change. 


But what is the conditi 
which may have, of course, different ab 


radiations? On the basis of thermodynamical considerations, Kirch- 


possible only if the intensity of the 
n a body is the same for all portions 
one another, Thus, 


ace in 4 sec, the 
its temperature 


1 2 be 


Aly Ay A; 


This relationship is known as Kirchhoff’s law and is valid for any 
wavelength and any temperature. It states that the ratio of the emis- 
sive power of a body to its absorptivity is a constant for any wave- 
length and temperature. 

This means that a body that is a good absorber of certain rays 
is also a good radiator of these rays, and vice versa, Why does the 


+ =p. 
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temperature of water in a bottle coated with silver rise slowly, and 
the temperature of water in a dark flask rise rapidly, under the action 
of solar rays? In the first case there is little absorption of solar ener- 
gy, while in the second there is considerable absorption. Now, let us 
assume both vessels are filled with hot water and placed in a refrig- 
erator. The water in the dark flask cools much more rapidly since 
the better absorber is also a better radiator. 

A striking experiment may be performed with coloured ceramic. 
If the colour of a body is, for example, green, it will zot absorb green 
light. Thus, if we heat a green crock, it is seen that it begins to 
assume a colour complementary to green. 

It should not disturb us that we have applied a law established 
for equilibrium to phenomena involving bodies clearly not in equi- 
librium (the body is at a higher temperature than its surroundings). 
The situation here is exactly the same as in the case of other thermo- 
dynamic problems (cf. p. 164): the laws of thermodynamics are appli- 
cable if every instantaneous state may be viewed as an equilibrium 
state. In thermal radiation phenomena, this condition is always 
satisfied. 


165. Black-Body Radiation 


Kirchhoff’s law has an interesting consequence. Bodies exchanging 
heat by means of radiation receive (for given values of v and 7) 
the same electromagnetic wave intensity from their neighbours, inde- 
pendent of the material and properties of the bodies. For every 
wavelength (or, what amounts to the same, every frequency) and for 
every temperature, experiments yield a universal value for p. Thus, 
there exists a universal function pọ (v, T), i.e., a function of the 
radiation frequency and temperature, characterising the process of 
thermal exchange by radiation. 

The meaning of the function p (v, T) is easily explained. Consider 
a body absorbing 100 per cent of the energy incident on it for all 
wavelengths. For such a perfectly black body, A = 1 and 


=p (v, T). 


The function p (v, T) is the emissive power of a perfectly black body. 
But what kind of body absorbs light of all wavelengths? Of course 
substances such as lampblack (soot) are almost perfectly black. 
However, all such substances fall short by several per cent of the con- 
dition A = 1. A more ingenious solution exists. Imagine a box having 
alsmall aperture. If the aperture’s dimensions are made sufficiently 
small, it may be made perfectly black. This property of apertures is 
well known from everyday observations. A deep hole, an open win- 
dow nonilluminated from within the room and a well are examples 
of perfectly black “bodies”. It is clear what happens in these cases: 


F 28* 
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A beam entering a cavity through an aperture is able to emerge only 
after repeated reflections (see Fig. 187). But with each reflection, 
part of the energy is lost. Therefore, in the case of a small aperture 
and a large cavity, the beam is unable to emerge, i.e., it is com- 
pletely absorbed. 

To measure the emissive power p (v, T) of a perfectly black body, 
a long tube made of refractory material is placed in an oven and heat- 
ed. Through an aperture in the tube, the nature of the radiation is 
studied by means of a spectro- 
graph. The results of such 
experiments are shown in 
Fig. 188. The radiation inten- 


Radiation intensity 


Fig. 188 


sily is plotted as afunction of the wavelength for several tempera- 
tures. It is seen that the radiation is concentrated in the relatively 
narrow spectral interval of 1 to 5 u. Only at high temperatures do 
such curves take in portions of the visible spectrum and begin to 
advance in the direction of short waves. Waves whose wavelengths 
are several microns long are called infrared waves. Since for or- 
dinary temperatures they are the main carriers of energy, we call 
them heat waves. , 
The higher the temperature, the more distinct the maximum of 
a thermal radiation curve. With increasing temperature, the wave- 
length Am corresponding to the maximum of the spectrum is displaced 
in the direction of shorter wavelengths. This displacement obeys 
the Wien law, which is easily established experimentally: 
Nn 2 2,886 
T 
In this formula, the wavelength is expressed in microns 
degrees Kelvin. The displacement of radiation in the di 
shorter wavelengths may be detected when a metal is heat 


temperature increases, the colour of the heat changes fr 
yellow. 


and T in 
rection of 
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We call the reader’s attention to another feature of the radiation 
curves, namely, it will be noted that all the ordinates increase sharp- 
ly with increasing 7. If @; is the intensity for a given wavelength, 
the total intensity of the spectrum is expressed by the integral 


œœ 


R= \ ndh. 


0 


This integral is simply equal to the area under the radiation curve. 
Exactly how rapidly does R increase with increasing 7? Analysis 
of the curves shows that it increases very rapidly, namely, propor- 
tionally to the fourth power of the temperature: 


R=oT" ergs/cm? sec, 


where o = 5.7 X 10-® (Gaussian units). This is the Stefan-Boltz- 
mann law. 

Both laws are important in the determination of the temperature 
of hot bodies at great distances. In this manner, the temperature of 
the Sun, stars and the ball of fire of an atomic explosion are deter- 
mined. F 

The laws of thermal radiation are basic to the determination of 
the temperature of smelted metal. The operation of optical pyrome- 
ters is based on the selection of the heating for an electric bulb fila- 
ment in such a manner that the luminosity of this filament becomes 
the same as the luminosity of the smelted metal. We make use of 
the following law: if radiations are the same, so are the temperatures. 
As for the temperature of the heated filament, it is directly propor- 
tional to the electric current flowing through the filament. Hence, 
it is not difficult to calibrate an optical pyrometer. Since actual 
bodies are not perfectly black, it is necessary to introduce in each 
case in the Stefan-Boltzmann formula a factor less than unity (the 
absorptivity of the given body). These factors are determined empiri- 
cally and are significant in heat engineering where problems of heat 
exchange by radiation are extremely important. Nevertheless, the 
above laws are valuable since the general behaviour of radiation 
(dependence on temperature and wavelength) is maintained for 
bodies that are not black as well. The theoretical aspects of black- 
body radiation are discussed in the following article. 


166. The Theory of Thermal Radiation 


Let us consider a cavity within which absorption and radiation of 
electromagnetic waves occur. It is immaterial whether this cavity 
is in the form of a sphere or a rectangular parallelepiped. The walls 
of the cavity radiate and absorb equal quantities of energy, i.e., 
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the entire system is in equilibrium. Within the cavity there is an 
electromagnetic field which is in equilibrium wih the walls: at all 
points the energy density of the field, w = za (Z? +H’), is con- 
stant in time. 

This electromagnetic field may be viewed in two different ways. 
From one viewpoint, there are standing electromagnetic waves in 
the cavity, just as there are standing sound waves in a closed room 
with sound sources. From the other viewpoint, in view of the quan- 
tum nature of the field, it may be stated that the space under consid- 
eration is filled with photons, just as a vessel containing gas is 
filled with molecules. 

From the wave viewpoint, the number of frequencies of electro- 
magnetic oscillations occurring in the cavity may be easily deter- 
mined. The reasoning used for sound waves (see p. 136) is completely 
applicable here too. The number of characteristic frequencies of 
electromagnetic oscillations less than v is equal to 


3 


where c is now the velocity of electromagnetic waves and V is the 
volume of the cavity. This formula gives the number of oscillations 
for the case of linearly polarised waves. In the case of thermal radia- 
tion, we are dealing with nonpolarised oscillations, which may be 
always resolved into components along two axes. 

ere, the number of oscillations is twice as large and is equal to 


v' . hres . * . P 
3 % al. Differentiating, we obtain the number of oscillations in 


the frequency interval from v to v + dy: 


BOVE dit 
c 


on from the “other side of the coin”. 
s filled with oscillations of frequen- 
ns of energy s = hy. The expression 


umber of photons in the cavity, 


8nay? 


2 
and Sar dv as the density of the photon gas. We shall soon be able 


to answer the following important question: Wh 
netic energy density in the cavity? If photons of 
ated in equal numbers, it would merely be n 


e by Sa dv to obtain the ene 
interval dy, However, 


at is the electromag- 
all energies were cre- 
ecessary to multiply 


rgy density for frequencies in the 
the particle energies are not distributed uni- 
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formly. Therefore, the formula being sought has the form 


eW (e) dv, 


8av2 


Wy dy = z3 


where W (e) is the probability of a photon of energy e being created. 
Thus, the electromagnetic energy density for waves (photons) 
of frequency v is given by the formula 


Sav? 
T3 


D= 


eW (8). 


The energy flux through a unit area, i.e., the Poynting vector K, 
is c times wy (see p. 325). But the energy flux p radiated from a unit 
area of a body in equilibrium with the field is one-fourth of the value 


of the Poynting vector: p=. Thus, between w and p, the fol- 
lowing relationship exists: p= pu. What is the origin of the coef- 


ficient Ly Since, on the whole, this is not a very important matter, 


+ 
the following simplified explanation should suffice. 
Every unit area radiates an energy flux p in all directions within 
the limits of a hemisphere, i.e., a solid angle 2x. Thus, the average 


radiation within a unit solid angle is equal to £. From geometry 


considerations, it is clear that the radiation is equal to zero in the 
plane of a unit area and a maximum along its normal. If the decrease 
in radiation intensity were uniform, to obtain the average value 


Fit would be necessary for the radiation along the normal to be 
equal to £ 5 


Now, consider a sphere filled with radiation. At the centre of the 
sphere there is a unit area through which there is an energy flux K. 
On the other hand, however, the radiation falling on this area from 


all parts of the sphere is equal to £x 4x. Hence, o=5K. 


Thus, using the formula for the volume density of electromagnetic 
radiation, we obtain an expression for the emissive power of a black 


body by multiplying wy by =: 


Qnv2 
Py = G2 


eW (e). 


Further investigation of this function involves evaluation of 
W (e), the energy distribution probability. Historically, the first 
formula for py was proposed in 1911 by Rayleigh and Jeans independ- 
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ently of each other. It has the following form: 


QnkT a 
es c2 


This formula was obtained by assuming uniform distribution of ener- 
gy per degree of freedom, i.e., W independent of e. It is valid for 
long wavelengths and high temperatures. 


Another possibility for W (e) is to use the Boltzmann law, which 


e 
was so successful in the case of molecular gases. Then, W (e) =e TT, 


However, as may be seen from Fig. 189, both the Wien emissive pow- 
er formula, 


2av2 oe 
2s RT 
qe hye £ 


Py = 


and the Rayleigh-Jeans formula do not 
results. 


Where is the fallacy in reasoning in these cases? It must be sought 
in the inapplicability of the statistical reaso 

of Boltzmann’s law to 
emphasised, photons give us a one-sided Pictu 
ic field. The reality of the field cannot be completely represented by 
a collection of particles. It is, therefore 

should have their “own statistics” 


agree with experimental 


nergy distribution replac- 
ake proper account of the 
akes the concept of a differ- 
ical particles meaningless, Bose-Einstein sta- 


asis (see p. 684). Tt leads to the fol- 
istribution: 


1 
Ww (e) = Te 
RT, 


e —i 


Therefore, the formula for the emissive 


Power of a perfectly black 
body has the following form: P 7 
— 2ny2 hy 
Pe ine hy 
Pr ars 


This formula was first obtained b £ and is named after him. 
The excellent agreement between thi retical formula and exper- 
imental results, and the nature of the 


deviations of the Wien and 
Rayleigh-Jeans formulas, are illustrated in Fig. 189, os 


Py 


-— Rayleigh- Teens 
— Wien 
— Planik (coincides 
with experiment) 


- 


Per cent deviation 
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The Wien and Stefan-Boltzmann laws considered above follow 
from Planck’s formula. To prove the first of these laws, it is necessary 


to solve the problem for an extremum, i.e., find the root of the 
equation 


pv __ 
a == (0). 


To prove the second law, it is necessary to find 


$ Py dv. 


We leave these calculations to the reader. 


PART THREE 
STRUCTURE AND PROPERTIES OF MATTER 


nn  — 


CHAPTER XXVI 


STREAMS OF CHARGED PARTICLES 


The simplest form of matter is an aggregate of charged elementary 
particles—electrons and ions. Systems of charged particles are en- 
countered in the form of beams of particles in which all the particles 
have a common velocity and move in a single direction and in the 
form of a gas in which the particles move randomly. Intermediate 
states are, of course, also possible. In this chapter, we shall consider 
the basic physical phenomena of such systems and describe equipment 
in which beams and gases of charged particles are used. Problems 
of electron emission, directly related to the physics of solid bodies, 
will be discussed in Chapter XX XVII. 


467. Motion of Charged Particles in Electric and 
Magnetie Fields 


A force f = eH + <[vB) is exerted on a charged particle in an 


electromagnetic field (see p. 267). If the fields # and B are given as 
functions of coordinates and time, and if the initial velocity and lo- 
cation of the particle are known, then for a particle moving with 
a velocity v < c the particle trajectory 7 (t) may be determined from 
the fundamental law of mechanics: 


dy 
nas = ips 


It is usually mathematically difficult to obtain an exact solution to 
this problem. An idea of the general nature of motion in a field may 
be obtained from an examination of the motion of a charge in a uni- 
form field. 

A Particle in an Electric Field. Assume that a particle enters 
a field at an angle 90° + a (see Fig. 190). For the choice of coordi- 
nates shown in the figure, the equations of motion take the form: 


dvx 
a nË and —; 2o 
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Whence, 


e 
u= TESA and vx = Vox. 


Integrating again, and assuming « = 0 for t = 0, we obtain 

: ‘ , 
j= -54 Et? +-vpyt and z= voxt. 

Eliminating, we obtain an equation of a parabolic curve which 
describes the motion of the electric charge (dotted in Fig. 190). 

If the particle enters the field at 
right angles (vo, = 0), ils path is 
described by the equation 


x2 
vä 


y=—3ŻE 


If the particle enters the field along 
a line of force, it will continue to move 
along the line of force with an accel- 
. e 
eration —E. 
m 


Fig. 190 Designating the potential difference 
between the initial and final posi- 


tions of the charged particle by V and using the kinetic energy 
equation, we obtain 


V= = (v?— ve). 


If the final velocity v > vo, then 


mv? e 
eV =—— andv=/ 2 £F. 
2 m 


This equation helps to make clear why the unit of energy known 
as the electron-volt is widely used: 


fev = 1.63 x 107 erg, 


An electron-volt is the work done in moving an electron through 
a potential difference of 1 volt. This unit may be conveniently em- 
ployed when the energy refers to a single elementary particle. The 
work of ionisation and the dislodging and escaping of an electron 
from a metal range from several to several score of electron-volts. 

A Particle in a Magnetic Field. The properties of the force acting 
on a charged particle in a magnetic field are well known (see p. 268). 

Assume a particle enters the field with an initial velocity vo. Let 
us resolve this vector into the components vj and vi, which are 
parallel and perpendicular to the field, respectively. Then, for motion 
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SN 
u 


in the plane perpendicular to the field, we obtain 
ma=—v,B. 
Pinel 


In the longitudinal direction, the particle will move uniformly with 
a constant velocity vy. 
5 j r er v? 
Motion in the perpendicular plane is circular, and a = F is the 
centripetal acceleration. Thus, 


E = mvi 
oe vB xia | Tie 
moye w $ " s 
Hence, R=—7 > i-e., the radius of curvature is directly propor- 


tional to the particle velocity and inversely proportional to the 
magnetic induction. It should be noted that all particles of a given 
kind in a given field will 
have the same angular 
4 3 eB 

velocity, i... © = n" 
Irrespective of the mag- 
nitudes and directions of 
the velocities, the parti- 
cles will have the same 
frequency of revolution Fig. 194 

about a flux line. 

If a particle enters the field at an incline to the direction of the 
field, it will move with a frequency o in a spiral of radius R (Fig. 191). 
Knowing vy, the projection of the velocity on the direction of the 
flux lines, we may determine the pitch of the spiral: 


uf , 20 2nme 
2=yT =v, Par zB Vije 
It is significant that the quantity vy = Vvo cos a, where œ is the angle 


formed between the initial velocity vector and the direction of the 
field, is constant to a high degree of accuracy even when the angular 
spread of the initial velocities is 5-10° (vy will vary by no more than 
1 per cent in such a case). Therefore, every z centimetres such 
a divergent beam of charged particles will converge in a point, i.e. 
focus (within the indicated limits) on a generating line of the eyl- 
inder on which the spiral trajectory may be viewed as winding. This 
generating line passes through the point at which the particle enters 
the field. 


Example. Assume that an electron, after being accelerated by a volt 
= 300 volts, enters a magnetic field of flux density B = 500 gauss ntaga 
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The secondary ionisers in the gas are electrons, not ions. The latter 
are able to ionise gas molecules only when the velocities are very 
high, i.e., greater than those usually prevailing. If the ions do not 
produce ionisation, the removal of the external ioniser stops the 
discharge even when the number of ion pairs due to impacts is hun- 
dreds or thousands of times larger than in the primary ionisation. 
Every avalanche is initiated by a primary electron, and since the 
electrons migrate toward the anode, the discharge ceases in the 
absence of an external ioniser as soon as all the electrons arrive at 
the anode. 

Such highly dependent discharges possess the following property: 
for a given voltage, the strength of the electric current passing 
through the gas is proportional to thenumber of primary ions creat- 
ed per unit time by the external ioniser. The ratio of the gas-ampli- 
fied current to the saturation current created by the primary ioni- 
sation may reach a value of several thousand. This property of 
a discharge is utilised in ionisation measuring devices—proportional 
amplifiers (see p. 532). 

An electric discharge may become self-perpetuating, i.e., continue 
after the external ioniser is removed, only if the ions also become 
creators of charged particles. This always occurs when the voltage 
is very high, i.e.—as indicated above—when the ions can ionise 
upon impact with gas molecules. In such a case, the ions will 
create more and more new electrons—the primary sources of ava- 
lanches. R 


However, a self-perpetuating discharge may also occu _ consid- 
erably lower voltages if the cathode of the gas-discharge tube takes 
the form of a plate. This is because ions are capable of dislodging 
electrons from a cold cathode. If the ion velocity is sufficient for 
such a process, the condition for a self-perpetuating discharge may 
be formulated as follows: new electrons appearing at the cathode 
must, at the very least, replace the work of the primary ioniser. 

Thus far we have said nothing about the role of pressure. At high 
pressures, the discharge column is compressed and thermal ionisa- 
tion begins. When the pressure changes, the current density distri- 
bution changes and so does the luminous nature of the gas discharge. 
At normal and higher pressures, various kinds of discharges are 
encountered, e.g., silent, arc and spark discharges. In the rare gases, 
glow discharges occur. 

A discharge is said to be silent when a leakage of charge from 
a condenser or other charged body is unaccompanied by sound or 
luminosity. Self-perp2tuating silent discharges—brush discharge and 
corona—may occur on spikes, thin conductors and, in general, 


wherever sharp drops in potential, and, therefore, large field inten- 
sities, exist. 
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At higher voltages, spark discharges occur, i.e., the gas breaks 
down. The breakdown voltage depends almost exclusively on the 
gas pressure in the region between the electrodes. At normal pres- 
sure, the air between spherical electrodes breaks down when the field 
intensity is 30 kv/em. Measurement of the breakdown distance may 
serve as a measure of high voltages. 

An electric arc is a special type of discharge. The current density 
in such a discharge is large even though the voltage between the 
electrodes is low. The distinctive 
feature of an are discharge, usual- 
ly created between carbon elec- 
trodes, is the extraordinary high 
temperature attained by the elec- 
trodes. Therefore, thermionic 
emission from the cathode plays 
a large role in an arc. 

In the rare gases, a glow dis- 
charge has a characteristic form 
for every pressure. The degree of 
rarefication may be determined 
experimentally with great accu- 
racy by mere observation of the 
discharge form. Various types of 
gas discharge forms are shown 


in Fig, 193. 
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Ina gas-discharge tube, an elec- 


tron stream moves in the opposite 7 P= 002 mm of Hg T 
direction to a stream of positive , 
ions. To obtain an ion ray, i.e., Fig. 193 


a beam of ions moving in one 
direction, a hole or canal is made in the cathode. A large propor- 
tion of the ions entering this aperture passes through it and then 
continues to move by inertia. Such beams, called canal or positive 
rays, were known to physicists as far back as the last century. 
A similar method of obtaining an ion stream is used even today. 
First, a substance is transformed into the gaseous state, Then, its 
molecules are ionised and the positive ions removed from the gas- 
discharge region through a cathode canal. è 
A gas discharge is not used to create an electron beam. A so-called 
electron gun serves as an electron beam source. This is a device based 
on the phenomenon of thermionic emission (see p. 695). Heated 


cS 29* 
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metals, as is well known, may serve as electron sources. Thus, 1 cm? 
of tungsten surface heated to 2,400° yields in one second a number 
of electrons corresponding to a current strength of 1 ampere. 

Fig. 194 is a diagrammatic representation of an electron gun. 
To accelerate the electrons, a voltage is applied across the electrodes. 
A tungsten filament (7) heated by an electric current serves as the 
cathode. The anode (2) has the shape of a glass with a round hole in 
the bottom. Electrons emerge from this aperture, which determines 
the divergence and width of the beam. 
The focussing electrode (3) makes it pos- 
sible to obtain beams of electrons which 
are fine and intense (see Sec. 171). 

The problem of obtaining an electron 
beam of maximum intensity for a given 
expenditure of energy is of great engineer- 
ing importance. 

To utilise all the electrons emitted by 
the filament, one must, in the first place, 
accelerate the electrons with a sufficiently 
high voltage. The filament emits a certain 
number of electrons per unit time. All 
these electrons must be drawn away from 
the filament. If the voltage is low, an 
electron cloud which impedes emission 


is formed near the filament. As volt- 
age is increased, the cloud is gradually dissipated and th erm- 
ionic current increases. Finally, we reach a voltage with which 


no electron cloud is formed. A further increase in voltage does not 
result in an increase in thermionic current since saturation has been 
reached. This is the condition required for electron gun operation. 
Thus, a sufficiently high voltage ensures that all of the electrons 
are drawn away from the filament. 

The next problem is to obtain increased electron emission from 
a filament. The emission from thorium and oxide cathodes is many 
times greater than that from tungsten cathodes. A thorium cathode 
consists of a tungsten wire coated with a very thin layer of metallic 
thorium. Thoriated tungsten yields the same current at 1,500° as 
pure tungsten does at 2,400°. An oxide cathode consists of a metal- 
lic base coated with a layer of an oxide of an alkaline earth metal. 
Such a cathode yields the same current at 900° as tungsten does 
at 2,400°. In modern electronic devices, oxide cathodes are 
heated indirectly. The cathode is manufactured in the form of a 


tube in which there is placed a tungsten spiral heated by an electric 
current. 


Fig. 194 
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An electron beam may be controlled by electric and magnetic 
fields. The action of such fields is not restricted to the deflection of 
a beam from its original direction. Thus, a parallel beam of electrons 
may be made to converge or diverge, and a beam diverging at one 
point may be made to converge at another. A “lens” for an electron 
gun is produced by a very simple system of fields. A very important 
branch of science known as electron optics, the most significant 
achievement of which is the electron microscope, has developed 
on the basis of this principle. 

Let us recall the properties of an ordinary double convex lens. 
If an object is placed on one side of such a lens, the image of the 


Fig. 195 


object on the other side will be magnified or reduced in size. This is 
because all rays emerging from an object point gather at an image 
point and, moreover, all image points are located in a single plane 
perpendicular to the axis of symmetry of the lens. The simple geo- 
metric construction of Fig. 195 shows why. the lens operates in such 
a manner: the angle of deflection of a ray which impinges on a lens 
is proportional to the distance % between the axis of symmetry and 
the point of intersection of the ray and the lens. The construction 
has been made for an object point lying on the axis of symmetry, 
but the results are similar for other points as well. It should be stipu- 
lated (as is done in optics) that the discussion is valid if the lens is 
thin and the beam divergent within the limits of a small solid 
angle. r 

We shall now show that electric and magnetic fields having axial 
symmetry may serve as lenses. Such fields may be obtained by means 
of the following: electrically charged plates with a round aperture 
in one of them, cylindrical condensers, loops of current and flat coils. 
There are a large number of systems which may serve as lenses for 
electron rays. However, an example of an electrostatic lens and an 
example of a magnetostatic lens will suffice to clarify the principle. 
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Let us consider a condenser in which a round aperture has been 
made in one of the plates (see Fig. 196). If an electron beam impinges 
on this aperture from the side of uniform field, the beam will be 


il 
Il 


— — 


Fig, 196 


focussed: When an electr 


on reaches the region of nonuniform field, 
a force perpendicul 


ar to the equipotential surfaces, and therefore 
at an incline to the axis of 
symmetry, acts on it. Resolv- 
ing this force into two com- 
ponents, we see that there is 
a radial component urging the 
electrons toward the axis. But 
this does not suffice for the 
system to act as a lens. It will 
also be necessary for the radial 
component of the field to be 
proportional to the distance 
and the point at which the electron 
reaches the plane of the aperture. It may be easily shown that 
this is indeed the case. The radial component of the electric field 
intensity may be expressed in the form 


Fig. 197 


between the axis of symmetry 
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ow 
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where 24 is the field intensity gradient along the axis of symmetry. 


To prove this, consider a small cylinder oriented as shown in 
Fig. 197, where the distance 7-2 is infinitely small. Since there is 
no charge inside the cylinder, ar*dZ, the flux difference between 
the ends Z and2, must be equal to —Z,2ardz, i.e., the flux through 
the lateral surface with the reverse sign. 

Thus, an aperture in a charged electric plate serves as a lens for 
an electron beam. 

Now, let us consider the behaviour of electron rays passing through 
a flat current-carrying coil (see Fig. 198). Such a coil constitutes 
a magnetostatic lens. The electrons move in a spiral and re- 
turn to the axis of symmetry 
after completing one turn ofa 
helix. The focussing properties 
of the coil are evident. It may 
be shown that the deflection 
angle of a ray is proportional 
to the distance of this ray from 
the axis of symmetry. The mag- 
netic coil changes the azimuth 
of the electron trajectory, i.e., 
in such a lens the image of an 
object is turned. But this angu- 
lar displacement does not dis- 
tort the electron-optical image. 

Thus, for an object scattering or radiating electron rays, an “elec- 
tron image” of the object may be obtained if an electrostatic or 
magnetostatic lens is placed in the path of the scattered electrons. 
When a photographic plate or luminous screen is placed in the 
plane of the image, a peculiar “picture” of the object is obtained. 
It is bright at points corresponding to the radiation or scattering 
of many electrons and dark at points corresponding to the absence 
of radiation or scattering in the object. Since a system of electron 
lenses yielding a magnified image of an object can be constructed, 
it is possible to construct an electron microscope. 
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The electron microscope, i.e., a microscope in which the role of 
a light ray is played by a beam of electrons, provides exceptional 
opportunities, not yet fully utilised, to “observe” objects directly. 
This is because the possibilities of magnifying an object are, general- 
ly speaking, unlimited in an electron microscope. On the other hand, 
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an optical microscope provides a magnification of not more than 
2,000-3,000. i ee. 

To understand the reasons for this difference, we must familiarise 
ourselves with the resolving power of a microscope. The question 
arises: What are the conditions for seeing two close points sepa- 
rately? 

Imagine that an ideal point source of light is located in front of 
a slit or round aperture.,When light passes through the aperture, 
a diffraction pattern is obtained. A lens placed behind the aperture 
does not concentrate the rays in a point. On the contrary, a blurred 
circle (or band, in the case of a slit) surrounded by alternately bright 
and dark rings appears. On p. 367, the magnitude of this blur for 
a slit was calculated. The radius of the disk, to which a point diffract- 
ed from a circular aperture corresponds, was given on p. 368. 
It is equal to Ippe 

Every optical instrument must have an aperture of entry—the ‘ 
objective. Diffraction at the objective is inevitable, and any lumi- 
nous point in the focal plane of the instrument js diffused into 
a luminous circle. The angular dimension of the radial blur is equal 


to 1.224, Therefore, its linear dimensions in the focal plane are 


equal to M22 Here, f and D denote the focal distance and the 


diameter of the objective, respectively. In the case of a microscope, 
this formula gives merely the order of magnitude since the object 
is close to the objective and, as a result, the beam of rays cannot be 
considered parallel. But since we are only interested in the qualita- 
tive picture, we shall not go into the fine points. 

If two luminous points observed in a microscope are so close that 
the centres of their. luminous image fields are closer to each other 
than a distance equal to the field radius, these two points cannot 
be distinguished as separate points. 


The limit of linear resolution in a microscope is equal to 1.2244 : 


Since the ratio of the focal distance to the objective diameter cannot 
be made significantly less than unity, a microscope enables us to 
observe two points separated by a distance of the order of a wave- 
length. Thus, when viewing in ordinary light (wavelength of the order 
of 0.5 u), we cannot detect object details smaller than a hundredth 
of a micron. 

What is the magnitude of useful magnification which may be 
obtained with an optical microscope? Imagine that a picture is 
viewed through an ocular, is photographed, then the latter photo- 
graph is viewed through an ocular, ete. It is evident that in this 
manner any desired magnification can be achieved. However, fur- 
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ther magnification loses 
eye that the resolution 


all meaning when it is seen with the naked 
limit of points of a photograph has been 


reached. Thus, if a photograph obtained with an optical microscope 


is magnified so that 0.5-4 mm corresponds 
to one micron, the limit of useful magnifi- 
cation has been reached. Hence, the useful 
magnification of such a microscope is about 
1-2 thousand. 

As will be shown in the next chapter, an 
electron ray has the properties of a wave of 
wavelength 


h 


mv? 


where k is Planck’s constant, m is the mass 
of an electron and v is its velocity. When 
the voltage is equal to 50,000 volts, the 
wavelength equals 0.05 A. But the distance 


between atoms is greater than 1 A. Hence» 
the usefulness of an electron microscope is 
not limited by its resolving power. 

Caleulations indicate that the resolution 
limit of an electron microscope is 2-3 A. 
At present, it is possible to achieve a re- 
solution of 5-6 IN, i.e., a useful magnifica- 
tion of a million. 

It turns out that there is much in com- 
mon between light optics and electron optics. 
In electron-optical instruments, we find the 
same elements and the same principles of 
construction encountered in ordinary optical 
instruments. The main difference (and this 
is not of a basic nature) is that the “index 
of refraction” of an electron-optical lens var- 
ies continuously, since the electric and 
Magnetic fields vary continuously, while 
that of an optical lens varies abruptly 


Fig. 199 


(at its boundary). Fig. 199 is a diagrammatic representation of 
an electron microscope: 7) electron projector, 2) condenser lens, 
3) object, 4) objective, 5) intermediate image, 6) projection lens, 
7) final image, 8) observation window. If we wish to examine an 
image directly, a fluorescent screen may be used instead of a photo- 
8raphic plate. An electron microscope is much larger than an optical 
microscope, requires a source of electric voltage and costs consider- 
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ably more. But this is compensated for by its tremendous resolving 
power. 

The electron microscope portrayed in the diagram employs mag- 
netostatic lenses. A high vacuum, of the order of 410 mm of Hg, 
is created in the system in order to prevent electrons from colliding 
with air molecules. An electron gun produces a beam of electrons 


ZZ za EA 
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Fig. 200 


with an energy corresponding to 50,000 volts. Therefore, the instal- 
lation must include a high-voltage transformer to boost the line 
voltage to the indicated value. 

Different methods of observing an object by means of an electron 
beam exist. Since matter is a very strong absorber of electrons, 
its thickness must be no greater than a fraction of a micron if we 
wish to observe an object in the “window”. When electrons pass 
through a thin layer of a substance, they are scattered differently 
by different portions of it. Fig. 200 illustrates the two methods used 
for electron vision. Only those electron rays transmitted through the 
substance without scattering are allowed to pass, while the scattered 
rays are blocked by a diaphragm (Fig. 200a). In such a case, the bright- 
est parts of the image correspond to those portions of the substance 
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which do not scatter electrons, including those portions where the layer 
of substance is particularly thin. On the other hand, the parts of the 
object which scatter electron rays in profusion are dark. The second 
method is the reverse of the first (Fig. 200b). The object is placed at 
an angle to the axis of the microscope, so that only scattered elec- 
trons are directed through 
tle lenses. It is- evident 
that the roles of bright and 
dark fields in the image are 
now reversed. 

The examination of objects 
in an electron microscope 
is usually performed on a 
base the thickness of which 
is about 0.01 u. Such a base 
is made in the following 
manner. A drop of asolution 
of collodion in amyl acetate j 
is placed on the surface of } Tig. 201 
water. The drop spreads on 
the surface forming a thin film which becomes quite firm after the 
amyl acetate evaporates. A loop made of thin wire is placed under 
the film. The object holder is now ready. This base will appear bright 
for normal incidence of the beam. It will appear dark when the 
beam impinges at an angle. 

If the objects under investi- 
gation are poor sctatterers of 
electrons, they will not be seen 
very well against the common 
background. The objects are 
sprayed with a metal to obtain 
more contrast. The base with 
the mounted object is placed 
in the path ofa stream of met- 
al atoms produced by vaporis- 
ing a metal in a vacuum. The 
£ spray is directed at an angle 

Fig. 202 to the base, and the sample 

5 becomes shaded as shown in 

Fig. 201. When an object is examined with electron rays an exceeding- 

ly bright picture is obtained since electrons are scattered only from 

the parts of the object sprayed with metal atoms. Fig. 202 shows 
how flu viruses look under an electron microscope. 

The examination of objects on a base is particularly important 
in biology and medicine. Bacteria are scooped up with the sample 


h 


hp 
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holder from the medium in which their presence is suspected. It is 
easy to study particles obtainable in a suspended state since they 
can be scooped up with a holder. 

Entirely different methods are used in examining the surface of 
a solid. Under certain circumstances, a solid may be made to emil 


electrons. By passing the resulting 
beam of electrons through lenses, we 
are able to see the surface. However. 
this procedure cannot always be used: 
when there is low emission, when 
the sample cannot be heated, etc- 
Under such circumstances, the rep- 
lica method is used. In this method, 
an object is coated with a thin layer 
of substance, which may be sepa- 
rated from the object and examined 
in the opening of an electron mi- 
croscope. Experiments show that 
such layers consisting of any one of 
a variety of substances, e.g., organic, 
metallic and quartz, form exact rep- 
licas-of the surface under investiga- 


3 tion. A photograph of the surface of 
frosted glass obtained in this manner is shown in Fig. 203. The replica 


method requires meticulous experimentation. It is no easy task to _ 
separate the layer of substance from the object. One of the methods 
used is to dissolve the object without damaging the film. 


173. Electron and Ion Projectors 


By means of an electron microscope, it has become possible to 
perceive large molecules as distinct spots or points. But the means 
are available to achieve considerably more, namely, the shape of 
a molecule may be discerned and a picture of its electron cloud 
obtained. This has been accomplished by means of special micro- 
projectors. 4 

Fig. 204 is a diagrammatic representation of an electron and ion 
microprojector. This consists of a vessel evacuated to 10-8 mm of Hg 
and containing electrodes. The cathode has the shape of a spike the 
point of which has a very small radius of curvature. It is possible 
to create near a cathode having this shape a field of the order of 
10° volts/em. For such a field, electrons are torn away from a cold 
cathode in a radial stream. If an obstacle is located in the path of 
the stream, a dark image appears on the fluorescent screen (or pho- 
tographic plate). If an object lies on the surface of the point, the 


173. Electron and Ton Projectors 461 


magnification is equal to the ratio of the distance between the point 
and the screen to the radius of curvature of the point. Using special 
means, the radius of curvature may be made less than 200 A. 

If molecules of a substance are placed on the point, their images 
appear on the screen. This has been done with phthalocyanide mole- 


cules, the dimensions of which are about 15 A. The form of the 
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Fig. 204 


molecule, its characteristic four-petalled structure, and the concen- 
tration and rarefaction of the electron density were clearly visible 
on the screen. 

Although this method can certainly not be used for all objects 
under normal laboratory conditions, the possibilities of a method 
yielding a useful magnification of more than one million should not 
be underestimated. 

But the resolving power may be increased by yet another order of 
magnitude and, moreover, the clarity of the image may be consider- 
ably improved. This may be accomplished by using an ion beam 
instead of an electron beam, but otherwise employing the same prin- 
ciple of object examination. An ion projector does not differ in prin- 
ciple from an electron projector. The point is given a positive poten- 
tial and when the field is large (10° volts/em) ions may be torn away. 
For this purpose, it is necessary that atoms or molecules be adsorbed 
by the surface of the point either beforehand or during operation of 
the projector. In the instrument shown in Fig. 204, a small quantity 
of hydrogen molecules is introduced into the vessel by means of 
a palladic tube. As soon as neutral atoms (or molecules) settle on 
the surface of the point they give up an electron and then, as positive 
ions, move toward the screen. 
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By means of such an ion projector, it has been possible to obtain 
the image of a tungsten point itself. An image arises owing the fact 
that adsorption of atoms occurs in specific parts of a tungsten crystal. 
In the obtained image, it was possible to discern the lattice period. 
i.e., the resolution attained equalled 2-3 A. 


174. The Electron-Beam Tube 


An electron-beam tube is a widely used device, being an essential 
component of a television set, radar system and oscilloscope. The 
principle of operation of such a tube may be explained by means of 
‘the simplified diagram shown in Fig. 205. We see in the figure an 
electron gun (Z) and two condens- 
ers (2) for deflecting the electron 
beam in two mutually perpen- 
dicular directions. 

Let us consider the application 
of an electron-beam tube to the 
recording of rapid processes. If no 
y voltage is applied to the deflecting 
Fig. 205 plates, the electron beam is direct- 


ed along the axis of the device 
anda spot appears on the luminescent screen. Assume that an alternat- 


ing voltage having a frequency greater than 20 cps is applied to the 
horizontal deflecting plates. Then, the electron beam begins to oscil- 
late in the vertical direction in synchronism with the varying field, 
Since electrons have very little mass, these oscillations will have 
practically no inertia.* The motion of the beam is not perceived 
because the luminous spot moves too rapidly for the human eye to 
follow; moreover, the screen has an afterglow. 

Now, let us consider the second pair of plates, which provide the 
so-called “sweep”. A saw-tooth voltage is applied across these plates. 
Since this second pair of plates deflects the beam horizontally, the 
luminous spot moves, say, from left to right quite uniformly under 
the action of such a voltage. When the edge of the screen is reached, 
the luminous spot rapidly returns to its initial position and the proc- 
ess begins anew. By changing the frequency of the saw-tooth voltage 
within a broad range of frequencies, we can vary the time scale of 
the horizontal sweep accordingly. 

If a sweep voltage is applied to the horizontal deflecting plates 
and the voltage being investigated is applied to the vertical deflect- 


* This absence of inertia is determined by the axial y 


elocity of the electrons. 
Therefore, ti 


record very rapid processes, high voltage oscilloscopes are used- 
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ing plates, a curve of voltage ys. time will be obtained on the screen 
since the horizontal coordinate of the luminous spot is proportional 
to the time reckoned from an arbitrary instant. 

An oscilloscope is particularly useful in the investigation of period- 
ic processes. It is always possible to select the sweep in such a man- 
ner that the curve described by the beam during one run from left 
to right coincides with the curve described during the second and 
successive runs. When the sweep period is fixed, we obtain a stationa- 
ry curve of voltage as a function of time in a given time interval 
(from a fraction of a period to several periods). 

A saw-toothed voltage is produced by a self-sustained oscillatory 
process analogous to that described on p. 100 (the toppling of a tub of 
water). The movement of the beam from left to right is produced by 
continuously and uniformly charging a condenser.* A discharge tube 
is connected to the terminals of the condenser. As long as the poten- 
tial difference across the tube is less than the ignition potential, the 
presence of the tube does not affect the charge on the condenser. 
When the potential reaches a critical yalue,-the condenser rapidly 
discharges and the process begins anew. The Saw-toothed oscillations 
must be synchronised with the periodic process under investigation. 

An electron-beam tube becomes more complex when a modula- 
tor is placed between the cathode and the anode. Such a modulator 
consists of a metallic cylinder, one end of which is covered with 
a diaphragm containing an aperture equal to the size of the cathode. 
A negative potential applied to the modulator makes it possible 
to control the intensity of the beam. At a certain value of voltage 
(the blanking voltage), the beam is completely cut off. Such blanking 
is necessary, for example, during the return trace ‘of the beam. Thus, 
by means of the modulator, the trace of the beam is blanked during 
the return sweep. 

Two variable quantities may be viewed simultaneously on a screen 
if an electron-beam tube is equipped with an electron switch that 
alternately connects the deflection mechanism in one measuring 
circuit and then in another. Double-beam oscilloscopes have been 


' developed for this same purpose. Such an instrument is equipped 


with an electron-beam tube having two independent electron pro- 
jectors and- deflecting systems. A double-beam oscilloscope has in 
addition two separate amplifiers for the voltages being investigated 
and two saw-toothed voltage generators. 

The choice of a proper luminescent screen for an electron-beam 
tube is of great importance. For certain purposes, long-persistent 
Screens are desirable, while for others it is required that the lumi- 
nosity disappear as soon as the beam is switched off. 


* A condenser charges and discharges exponentially, but by using a small 
portion of the exponential curve these processes may be made quite linear. 


Bs 


464 Streams of Charged Particles 


Single-pulse processes may be recorded with an electronic oscil- 
loscope if it is equipped with an auxiliary camera the shutter of 
which is synchronised with the sweep. This makes it possible to 
photograph the screen at the required instant. 


_ 175. Mass Spectrograph 


The fundamental equation of motion of a charged particle, 
d?r HPE 
m-a =e (2 +—[vB)) 5 
shows that the path of a charged particle is determined by <, the 


ratio of the charge of the particle to its mass. Therefore, measure- 
ments of the deflection of a charged particle in an electric and a mag- 


netic field may be used to find =. Since the initial velocity of a par- 


. . e . e p 
ticle is not known, m cannot be determined by measuring the deflec- 
tion in either an electric or magnetic field alone. The general formu- 


8- direction away from us 


% 


Fig. 206 


las for deflection in electric and magnetic fields (Sec. 167) show that 
the path is determined by coefficients containing — and the initial 
m 


velocity. The problem is solved by measuring the deflection of one 
and the same particle in an electric and a magnetic field. 

In the simplest case, it is sufficient to balance the electric and mag- 
netic deflections. For this purpose, the fields should be oriented as 
shown in Fig. 206. Charged particles will not be deflected when the 
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following condition is satisfied: e£ = 4 evH. This experiment 
enables us to determine the velocity of a particle. Now, it is merely 


necessary to measure the deflection produced by the electric field 
or by the magnetic field alone. Knowing the magnitude of the deflec- 


tion of a particle from its rectilinear path, we may calculate ~. 


Measurements of = are of great importance in atomic physics 


as a means of determining the mass of a particle when the charge is 
known. This pertains particularly to the determination of the mass 
of ions. 

An instrument in which the particles of a beam may be sorted 
according to mass, and the composition of the beam according to 


mass may be investigated, is called a mass spectrograph. The mass 
spectrograph proposed by Aston is represented schematically in 
Fig. 207. Its principle of operation may be explained as follows. 
Particles of various velocities are introduced into the electric field 
of a condenser. Consider a group of such particles having the same 


= ratio. Upon entering the electric field, a stream of these particles will 


be divided since fast particles are deflected less than slow ones in an 
electric field. Now, this spread of particles is introduced into a magnet- 
ic field (perpendicular to the page). The sense of the field is such that > 
the direction of deflection of the particles is opposite to that in the 
electric field. Here, too, fast particles are deflected less than slow 
ones. Hence, it follows that at some point beyond the field the divid- 
ed beam of particles will again gather at a point, i.e., become 
focussed. 

Particles having a different 2 ratio will also gather at a point, 
but not at the same one. Calculations indicate that the foci of all 
> lie approximately on a straight line. If a photographic plate is 


Placed along this line, each group of particles will be represented by 
4 separate line. 
30-1409 
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If a mass spectrograph is constructed with great accuracy, its 
resolving power will be extremely high and it can be employed to 
detect the presence of very close isotopes. At first glance, such pre- 
cision may appear unessential since it may be reasoned that the 
masses of isotopes differ by at least one atomic weight unit. But 
while it is true that isotopes of one and the same chemical element 
differ by an atomic weight unit, isotopes of different elements (e-g., 
S35 and A®*) may differ very little in mass. Moreover, it is important 
to be able to determine the mass of complex ions. Such problems 
arise, for example, in connection with the chemical analysis of gas 
mixtures. Different particles may then turn out to be close in mass, 
e.g., CH, and N or N“H, and O". All such problems may be 
solved by means of a mass spectrograph. 


176. Accelerators of Charged Particles 


Actually, all such devices as electron tubes, X-ray tubes and elec- 
tron guns are accelerators of charged particles, but this term general- 
ly denotes installations producing streams of charged particles 
(electrons, protons, deuterons, etc.) moving with velocities close 
to the velocity of light. Such streams of particles are then allowed to 
impinge on matter. The interaction achieved may be used for various 
purposes: investigation of nuclear transformations, production of 
radioactive isotopes, medical purposes, chemical action, etc. The 
role of accelerators in modern science is an extremely important one. 

It is possible, of course, to accelerate a particle to any desired 
energy by making it pass through successive accelerating fields. 
However, such a brute force approach, i.e., the construction of a 
linear accelerator, is not always practical. To accelerate particles 
to energies of only tens of thousands of electron-volts requires a path 
length of the order of many centimetres. But modern physics is 
striving to obtain streams of particles with energies of tens of billions 
of electron-volts. Such a linear accelerator would have a length of 
tens of kilometres. Practically, therefore, such a solution to the 
problem is unacceptable. \ 

Lawrence was the founder of high-energy accelerators. His basic 
idea is that in a single installation particles should be accelerated 
by an electric field and repeatedly made to return to the same accel- 
erating gap by means of a magnetic field. The first accelerators 
operating on this principle became known as cyclotrons. 

A cyclotron is represented diagrammatically in Fig. 208. The 
accelerating chamber may be pictured as a flat circular pill box cut 
along a diameter. It is of large dimensions, made of metal, and its 

two halves—the basis of the accelerating chamber—are known a$ 
Dees. An alternating electric field of period T is applied to the Dees 
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This entire system is placed between the poles of an electromagnet, 
which creates a strong constant magnetic field inside the box per- 
pendicular to its base. 

The magnetic field intensity is determined by the period of the 
electric voltage. That is, the value of the field intensity must be such 
that the period of revolution 


in this field (expressed by the Jeftecting 


Target 


formula T = om) is equal to 
the period of the electric field. Orbis 
When this condition is satisfied, 
charged particles entering the 
accelerating chamber are cap- 
tured by the fields and accel- 
erated spirally with constant 
period 7’. Thus, a particle in 
the gap between the Dees is 
accelerated, traverses half a 
circle in the magnetic field and 
arrives at the accelerating gap 
just when the voltage phase Fig. 208 
changes by 180°. Thereupon, 
the particle is accelerated in the same direction and enters the 
other Dee where it traverses half a circle of larger radius. Each time 
the particle passes through the gap its velocity increases. The particle 
must be deflected from its circular path to eject it from the cyclotron. 
A cyclotron has limited usefulness. As the velocity of a particle 
increases its mass increases and, hence, its period of revolution in 
the magnetic field also increases. As a result, the particle begins to 
lag, i.e., it arrives at the accelerating gap when the phase of the 
voltage has changed by more than 180°. This lag increases until, 
finally, the electric field not only does not accelerate particles, but 
retards them. Calculations indicate that the maximum energy that 
a cyclotron can transmit to a charged particle is given by the formula 
9 / evome® 


Source of 
charged 
particles 


Vacuum chamber 


Radio-frequency . 
generator 


. For protons this amounts to 22 million electron-volts 


(22. Mev), and for a-particles about three times more. New means 
are necessary to attain higher energies. 
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Veksler in the Soviet Union and McMillan in the United States 
elaborated a new concept in the accelerator field in 1944 and 1945 
respectively. This concept may be summarised as follows: Examina 


30* 


— 
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tion of the formula for the period of revolution of a particle shows 
that increased mass may be compensated for by an increase in mag- 
netic field. If such compensation takes place, the period of revolution | 
of.a charge will remain constant. ; sty : 

Let us assume that the magnetic field intensity increases during 
the operation of the cyclotron. Among the thousands of millions of 
particles moving in the accelerating chamber, there are undoubtedly 
some particles whose increase in mass due to increased velocity is 
exactly proportional to the increase in magnetic field. Hence, the 
period of revolution of such particles will remain unchanged. Calcu- 
lations indicate that under such conditions other particles will also 
not fall out of synchronism. The only difference, which is quite un- 
important in practice, is that the energy of these particles will not 
increase in a monotone manner, like the energy of the “lucky” par- 
ticles whose increase in mass is completely compensated for by the 
increase in magnetic field, but will fluctuate about the energy value 
of the “lucky” particles. 

The orbit radius of the “lucky” particles agrees with their energy- 
This is the reason for their “good fortune”. Now, let us consider a 
particle the energy of which is greater than that required for a given 
radius. Such a particle will be retarded owing to an excess of mass 
increment. On the other hand, if a particle has less energy than 
required for a given radius, the mass increment will be insufficient 
and the particle will be accelerated relative to other particles at 
the same radius. Thus, since the mass of a particle increases with 
velocity, particles can, so to speak, regulate their own velocity and 
select voltage phases which serve to correct their motion. That is 
why this phenomenon is referred to as “auto-phasing”. 

It transpires, therefore, that it is possible, in principle, to increase 
the velocity of particles in a cyclotron without limit if the magnetic 
field intensity is gradually increased. In such an installation par- 
ticles must be accelerated in pulses. When the field increases parti- 
cles are accelerated, but the reverse cycle is idle. 

The above method is not the only way to achieve auto-phasing. 
Another approach is to slowly vary the period of the electric voltage. 
The principle is basically the same: an increase in the mass of a 
charged particle results in an increase in the period of revolution in 
the magnetic field; in this case, the regime of the alternating electric 

field is varied so as to compensate for this increase. An accelerator 
in which the period of the electric voltage is slowly increased is 
known as a synchrocyclotron. The path of a particle in a synchro- 
cyclotron is a flat spiral. The farther this spiral extends, the greater 
the energy of the particle. Thus, an increase in energy is associate 
with an increase in the area of the accelerating chamber in the mag- 
netic field. The most powerful accelerator of this type is the synchro- 
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cyclotron of the U.S.S.R. Academy of Sciences. This accelerator 
yields a beam of 680 Mey protons. Its magnet weighs 7,000 tons. 


. The energy limit of a synchrocyclotron is of the order of hundreds 


of Mey since a further increase in energy would result in an unthink- 
able increase in magnet weight. Particle energies of thousands of 
millions of electron-volts (Bey) are attained by means of proton 
synchrotrons. X 
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Proton synchrotrons are basically different from cyclotrons. Since 
a proton synchrotron accelerates particles in a single circular orbit, 
the volume of the magnetic field may be greatly reduced, The entire 
central region of the magnet is, so to speak, cut out. As a result 
much less steel is required for the magnet. Thus, the electromagnet 
of the 10-Bev proton synchrotron of the U.S.S.R. Academy of Sci- 
ences weighs 36,000 tons, but the electromagnet of a 10-Bev synchro- 
cyclotron would weigh about 4 million tons. 

To accelerate particles in an orbit of constant radius, one must 
vary the period of the accelerating electric field and the intensity of 
the magnetic field in a very definite manner. In such an installation, 
particles are accelerated in pulses. Each pulse is obtained by increas- 
ing the magnetic field and decreasing the period of the accelerating 
electric voltage in a prescribed manner. 

For a given orbit radius, a unique relationship exists between the 
field intensity and the velocity: 
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and the relationship between the period of revolution and the veloci- 
ty is determined by the expression 
2nr 


v 
If these conditions are satisfied, a “lucky” particle will be accelerated 


in a monotone manner. 

Since the auto-phasing conditions occur here too, the remaining 
particles follow a path which oscillates about the circular orbit and 
they also take part in the synchronous increase in velocity. Since 
particles oscillate about an average circular orbit, it is necessary 
to make the width of the pathway for the charged particles rather 
broad. Methods are being sought which would allow us to reduce the 
width of this pathway. Success would lead to a reduction in the 
amount of steel required for a given accelerator and make it practical 
to construct accelerators with even higher particle energies. 


410 Streams of Charged Particles 


Thus far we have been discussing accelerators of heavy particles. 
Electron accelerators have a number of special features. 

As far back as 1941, an accelerator known as a betatron was con- 
structed to accelerate electrons. Such an installation operates on the 
principle of a transformer. The winding of a magnet constitutes the 
primary winding and a beam of electrons revolving in a circle of 
constant radius constitutes the secondary “winding”. In other words, 
the electrons move along a circular line of force of the rotational 
electric field produced by an alternating magnetic flux. 

At first glance, it appears that such acceleration may continue 
without limit. The increase in mass with velocity does not limit the 
acceleration since there is no need for synchronism in this phenome- 
non. Nevertheless, a betatron does have a limit. At energies of sever- 
al hundred Mey, energy losses due to radiation become considera- 
ble—an accelerated electron radiates electromagnetic waves. As a result 
of this radiation, the electron path is transformed from a circle to an 
inward spiral and acceleration becomes impossible. Betatrons are 
useful when electrons with energies from 20 to 50 Mev are required. 
If greater energies are desired, electron synchrotrons must be used. 
Such accelerators were first proposed in 1946-47, 

The electron} synchrotron is similar to the synchrocyclotron de- 
scribed above, i.e., itis a resonance accelerator. An accelerating elec- 
tric field is added to the magnetic field of the betatron. The accel- 
erating mechanism is maintained by auto-phasing. But the fact that 
we are dealing with lighter particles, namely, electrons, simplifies 
the construction problem. This is because at energies of only 2 to 
3 Mev the electron velocity is almost equal to the velocity of light. 
Hence, when the energy increases further, the radius of the electron 
path does not change. This makes it possible to construct the magnet 
in the form of a ring as in the case of the proton synchrotron. The 
radiation losses of the electron synchrotron are compensated for by 
increasing the accelerating voltage. 

At- high-energy levels, radiation losses reach imposing values. 
In a 800-Mey accelerator having an orbit radius of 4 metre, an elec- 
tron radiates 1,000 ev per revolution. In a 10-Bev electron synchro- 


tron having an orbit radius of 20 metres, the energy losses per revolu- 
tion would be equal to 30 Mev. 


CHAPTER XXVII 


THE WAVE PROPERTIES OF MICROPARTICLES 


179. Diffraction of Electrons 


Fig. 209 shows X-ray and electron patterns of scattering from the 
same substance. The close similarity of the patterns indicates that 
diffraction also occurs in the case of electrons. If the wavelength 
2 is known, one may determine from a roentgenogram the values of 
the interplanar distances by means of the equation nA = 2d sin 0 


Fig. 209 


(see p. 385). We can calculate 4 by measuring the angle 0 of all the 
vings on the electronogram and using the values of d determined from 
the roentgenogram. The same value is obtained for each ring. This 
shows that the pattern is produced by diffraction and that.a specific 
wavelength is associated with a beam of electrons. 


In order to obtain such a pattern, one must place a thin film of matter in 
the path of an electron beam. Electrons are_easily absorbed by matter and 
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will not pass through a film the thickness of which is more than about 107° cm. 
Electronograms may be obtained by means of an electronograph, an instru- 
ment similar to an electron microscope. In fact, one may use an electron micro- 
scope to obtain diffraction electronograms. For this purpose, it is merely 
necessary to remove the lens. 

Electron diffraction is used for the same purposes as X u 
electronography has a number of advantages over roentgenography. The main 
advantage is the short exposure time required. Matter scatters electrons much 
more effectively than X-rays. An electronogram may 


be obtained in a period 
of time measured in seconds, while a roentgenogram- requires at least several 
minutes. 


Since electron beams do not penetrate matter to any extent, electronography 
may be conveniently employed to investigate the structure of surfaces. Elec- 
tronography may also be used to study the distribution of atoms in crystals, 
assuming, of course, that the structure is not complex. 


-ray diffraction, but 


We are not so much interested in the 
graphy as in electron diffraction itself. It 
determine the wavelength of an electron beam. This may be done 
experimentally. When the acceleratin 


g voltage U is varied, the 
wavelength varies. It turns out that A is inversely proportional to 
V U. Thus, when 2 is expressed in Angstroms and U in volts, 


applications of electrono- 
is of great importance to 
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As far back as 1924, before the discovery of electr 
Louis de Broglie advanced a bold hypothesis. He proposed that the 
concept of the dual nature of an electromagnetic field, its manifesta- 
tion in the form of a wave of frequency v and wavelength ^ as well 
as in the form of particles (photons) with an energy Æ = hy and 


a momentum p = a =4 ; Should be broadened to include particles 
of matter. Experiments in electron diffr 
pothesis. The formula for the electron 


Ay 


on diffraction, 


action confirmed this hy- 
wavelength 


P mv 
is easily converted into the form 
4, — const* 


h 
=7= > Where const* =—“_ 
Wane Vem’ 


by means of the relation m= eU or v= y Ze. 


Substituting the values of e, m and h in the C 


and converting U into volts, we obtain the aboy 
stant. 


GS system of units, 
e value for the con- 
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By means of the above formula, one can obtain the following electron wave- 
lengths for the indicated values of .accelerating potential: 
U (volts) 1 100 1,000 
y (A) 42.95 4-22 0.39 


Since diffraction from a crystal is possible when the wavelength is of the: 


same order as the period of the crystal lattice (~4 A), it is seen that electrons 
the energies of which are of the order of a hundred electron-volts may be diffract- 
ed from a crystal. 


180. The Fundamental Concepts of Quantum Mechanics 


As we have seen, an electron beam may behave as a wave having: 
h ° ° 
a wavelength LS Is this behaviour typical of a large group of 


electrons or are wave properties inherent in single electrons? To 
answer this question, let us compare the electronograms obtained 
with a low and a high intensity beam of electrons. In one such exper- 
iment, the beam was of such low intensity that the average time 
interval between two successive passages of an electron through the 
diffracting system was 30,000 times greater than the time taken by 
an electron to travel from the filament to the photographic plate. 
Nevertheless, the diffraction pattern differs in no way from another 
electronogram obtained with a beam the intensity of which was 
107 times greater. Such experiments show that the wave property is 
inherent in single electrons. Thus, what was stated on pp. 438-39. 
with respect to a photon is also valid for an electron. An electron 
does not behave like a projectile or pellet. The behaviour of an 
electron cannot be described by means of Newton’s laws of mechan- 
ics. The field devoted to the investigation of the behaviour of 
microparticles is known as quantum mechanics. 

This dual behaviour is not peculiar to an electron alone. Wave 
behaviour is characteristic of all microparticles. Thus, it is possible, 
for example, to observe neutron diffraction. Diffraction of helium 
atoms and hydrogen molecules have also been observed. As will be 
seen below, the greater the mass of a particle the more do its wave 
properties recede into the background. But more about this later. 
Let us designate by p the amplitude of the wave associated with 
a microparticle of mass m. This amplitude, like the amplitude of 
a wave of any kind, is a function of coordinates. The function (x, Y, 
z) or “psi function”, must satisfy the wave equation (see p. 341): 


Apt- =O. 


Let us substitute in this equation the microparticle wavelength 
jen , expressing the velocity of the particle in terms of its energy. 
mv 


ER 
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If the particle moves in a field with a potential energy U and has 
a total energy ĝ, then 


mv? 


2 =6—U and v= y 26-0). 


Substituting in the wave equation, we obtain 


822m 


A+ he (6—U)p=0. 


This equation is called Schrédinger’s equation and is the fundamental 
law of quantum mechanics. It should be noted that we have merely 
performed a substitution in the wave equation, which by no means 
constitutes a derivation of the fundamental law of quantum mechan- 
ics. The substitution should be viewed rather as a conjecture leading 
to the discovery of this equation. 

Much like Newton’s fundamental law of motion, the Schrédinger 
equation is a law of nature encompassing, as we shall presently see, 
an extensive range of phenomena.* 

This differential equation enables us, in principle, to find at all 
points in the region under consideration the amplitude p (a, y, z) of 
the wave associated with a microparticle. 

How can the validity of the Schrédinger equation be verified? 
This may be done by means of a basic confirmation of the theory: 
the intensity of the wp-wave at every point in space, i.e., the quantity 
wp’, is the probability density** of an electron at this poi is space. 
As for the amplitude of the wp-wave, it cannot be determi ed experi- 
mentally, like the field intensity vectors of an electromagnetic wave. 

The description of a particle by the wp-function is not to be regarded 
as an incomplete, imperfect method of describing its motion. It 
would be incorrect to believe that it is solely due to the peculiarities 
of quantum mechanics that a particle has a probability wp? (x, y, 2) 
of being at a given point in space and that by means of a better 
theory the path of the particle could be determined and its location 
indicated with certainty. An exact description of the motion of 
a particle, i.e., the determination of its path, is not possible because 
the behaviour of a microparticle is completely different from that 
of a large body. A microparticle is not a particle in the classical 
sense. 


Let us again refer to the experiment with two slits which was used 
to illustrate the dual behaviour of a photon. 


* We have simplified the picture by not considering the dependence of p 

on time. The exact Schrédinger equation takes this dependence into account. 

** That is, the probability of finding the particle in a small volume divid- 
ed by the magnitude of this volume. 
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Assume that a beam of electrons impinges on a screen in which 
two slits are cut. Diffraction occurs. Let us direct our attention to 
one point on the screen where, say, interference annuls the wave. 
If only one slit is kept open, electrons reach this point. If both slits 
are open, however, electrons do not reach this point. This phenome- 
non cannot be explained by the collective behaviour of the electrons. 

If we insist on the electron’s behaviour as a classical particle, it 
must be accepted that an electron passing through one slit “knows” 
whether the other slit is open or closed, and behaves’ accordingly. 
It must be concluded that an electron is not a classical particle. 
Every electron has wave properties and the concept of an electron 
path is not fully applicable. Therefore, the question whether the 
electron passed through one slit or the other when both slits were 
open is meaningless. Such a question is meaningful only for “ordi- 
nary” particles, but not for microparticles. 

Does this mean that it is meaningless to speak of the velocity and 
coordinates of a microparticle? This question is answered by the 
so-called indeterminacy (or uncertainty) principle formulated by 
the German physicist Heisenberg. 
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This principle indicates the limits within which the classical 
description of /particles applies to microparticles. 

Applicability of the Path Concept. Let us assume that we wish 
to determine the coordinate of a microparticle at some point x and 
are able to do this with an accuracy of Az. To “see” a microparticle, 
one must use a “microscope” in which the wavelength of light is 
sufficiently short (the shorter the wavelength the greater the resoly- 
ing power). In principle, Az may be made as small as desired. For 
this purpose, it is merely necessary to reduce the wavelength until 
Az is of the same order of magnitude as the wavelength: Az ~ À. 


However, if the wavelength is short, this means that the corre- 


h s 
sponding photon has a large momentum: p => . This momentum 


will be transmitted to the particle being “observed” under the micro- 

scope, i.e., when the particle is “flicked” its momentum changes 
v ` aay h x 

by a Ap the order of magnitude of which is T . By decreasing 4, we 

decrease Az, the uncertainty of the coordinate, but at the same time 

we increase Ap, the uncertainty of the momentum. Eliminat- 


, 3 A h 4 
ing À from the following relations: Ax ~ A and Ap ~ + , We obtain 


the equation . 
Az x Apxh, 
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the expression for the uncertainty principle. This indicates that spec- 
ifying the path of a microparticle has physical meaning only if it 
is understood that the coordinates and momentum in a given direc- 
tion have uncertainties which satisfy the inequality 


Az x Ap>h. 
This remarkable relation establishes the limits of applicability of 
classical physics to a microparticle. rN, 
Substituting the product mv for the momentum p, which is justi- 


fied in the case of velocities which are not very close to the velocity 
of light, we obtain the condition 


h 
(Men 5% Av eh 
a relation between the uncertainties in a coordinate and the corre- 
sponding particle velocity along the z-axis. The right member of the 


relation will vary by many orders of magnitude, depending on 


whether we are dealing with an electron, atom, molecule or tennis 
ball. i 


For an electron 
h 


— 7 cm?/sec, 
m 


hence, the uncertainties in a coordin 
ticle velocity are related as follows: 
Az x Av>7. 
Consider an electron located within an atom, the diameter of which 
is 10-8 cm. Can the motion of the electron in the atom be described 
as if the electron were an “ordinary” particle? Using the uncertainty 
principle, we find that Av ~ 10° cm/sec. Thus, we can only speak of 
the velocity of an atomic electron in very general terms. The con- 
cepts of an electron path in an atom, an electron path of transition 
from one energy state to another (see below), etc., are meaningless. 
In short, an atomic electron is quite different from an “ordinary” 
particle. 
Now, let us assume that 
chamber and that we wish t 
. order of several tenths of a 


ate and the corresponding par- 


o trace its path with an accuracy of the 
soe tee re i millimetre. If the width of the partie’ 
rack 1s At = 10 cm, then accordin sertai rinciple 
Rowen e Thee g to the uncertainty princip 
the velocity. Even if the electr 

tainty of the indicated order is insignificant and specifying the 
electron path becomes meaningful, Similarly, we can speak of the 


I microscope and the path of electrons 
In an electron-beam tube without coming into conflict with the 
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The mass of protons, neutrons, atomic nuclei and atoms is thou- 
sands of times greater than that of an electron. Therefore, the classi- 
cal paths of such particles are of somewhat greater usefulness. Thus, 
for example, in the case of an a-particle, the mass of which is about 
7,000 times greater than the mass of an electron, 


Az x Av > 10°. 


Is it meaningful to ask at what location in an atom did the path 
of an a-particle penetrating a substance pass through the atom? 
We wish to trace the path with an accuracy of 10-° em and we have 
at our disposal data on the lateral component of the velocity the 
uncertainty of which is 10° cm/sec = 10 km/sec. For a fast -particle 
(20,000 km/sec), this uncertainty is insignificant. Therefore, it is 
possible to say whether the path of the «-particle penetrating the 
atom passes far from the centre of the atom or not. 

On the other hand, it is meaningless to speak of the path of pro- 
tons or neutrons in nuclei, since the size of a nucleus is ~ 10- cm. 

For large molecules; e.g., proteins having a molecular weight of 
the order of 10°, the uncertainty principle is of no significance. 
Here, Az x Av > 107; hence one can reliably define the path of 
such a molecule in considerable detail. Even the random thermal 
motion of such a molecule, the average velocity of which is of the 
order of 4 m/sec, may be traced, and the path of its centre of gravity 
indicated with an accuracy of the order of 4 A. 

Needless to say, a particle of dust, even if it is visible only under 
a microscope, is too large for the uncertainty principle to be of prac- 
tical significance. 

The Simultaneous Measurement of Two Physical Quantities. 
It should not be thought that the inability to determine the path 
of a particle is due to a measuring deficiency which will eventually 
be overcome by physicists. The lack of meaning in specifying a pair 
of physical quantities with ideal exactness is a peculiarity of micro- 
particles. The methods of describing the behaviour of an “ordinary” 
particle are inapplicable to a microparticle. Only for a classical par- 
ticle is it meaningful simultaneously to specify and define its coordi- 
nate and momentum. 

The. principle of uncertainty has broader significance than being 
simply a means of judging whether or not the path of a particle 
can be determined. As an integral part of the mathematical appara- 
tus of quantum mechanics, the principle of uncertainty enables us 
to evaluate the possibilities of simultaneous measurement of any 
physical quantities, not only coordinate and momentum. 

First, let us define the uncertainty principle in relation to coordi- 
nate and momentum. It should be recalled that a particle has three 
Coordinates and that a momentum yector has three components. 


ei ld 
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Instead of the one relation discussed above, three relations should 
be written, namely: 


Az Xx Apz>h; Ay x Apy>h; Az x Apz>h. 


The possibility of simultaneously determining (specifying) all the 
coordinates along the different axes, and all the momentum compo- 
nents, is also considered in quantum mechanics. It turns out that it 
is possible (meaningful) simultaneously to specify the coordinates 
or simultaneously specify all three momentum components. Why 
is this point being emphasised? It would seem that it is always pos- 
sible simultaneously to determine the three components of any 
vector. Careful consideration indicates that this is not so. An exam- 
ple of a vector the three components of which cannot be determined 
simultaneously is the angular momentum (i.e., moment of momen- 
tum) of a particle. 

Assume that a particle rotates about an axis with an angular mo- 
mentum L. This motion may be viewed as the resultant of three 
rotations about three mutually perpendicular axes with 
moments L,, Ly and L,. ln the case of an “ordinary” particle, the 
three components of the angular momentum may be determined 
separately since the path of the particle may be traced. In the case 
of a microparticle such a- determination is not possible, and simul- 
taneous specification of all three components of the angular momen- 
tum is meaningless. To clarify this point, let us assume the converse 
for a moment, namely, that all three components of the angular 
momentum are known. But then the total angular momentum could 
be constructed from the three components. In that case, the plane 
in which the particle moves is determined. But if this plane is 
known, then we know precisely the coordinate of the particle along 
the axis of rotation and note simultaneously that the velocity of 


translatory motion along the axis of rotation is equal to zero. This 
contradicts the principle of uncertainty relating coordinate and 
momentum. 


angular 


Thus, it is characteristic of a microparticle th 
simultaneously to determine the three components of its angular 
momentum. What data relative to its rotation may be specified 
simultaneously? The uncertainty principle gives the following 
answer: any component and the absolute value (vector length) of the 
angular momentum. There is one exception to this rule: complete 
absence of rotation may be established for a microparticle, i.e., the 
angular momentum vector may equal zero; in other words, all three 
components equal zero simultaneously. 4 

Energy and Time ‘Interval. On the basis of the uncertainty prin- 
ciple relating the coordinate and momentum of a p 


] article, one may 
suspect that a more or less analogous relation which involves energy 


at it is not possible 
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exists. Thus, in ordinary particle mechanics, it was necessary to 
know simultaneously the position and velocity of a particle in order 
to calculate the total energy as the sum of potential and kinetic 
energy. For a microparticle this is not possible. However, the total 
energy of a particle may be found as a whole, i.e., without separa- 
tion into parts, and on the basis of what was just said it is natural 
to expect that this may be done with an uncertainty A@. If it is 
assumed that the uncertainty principle maintains its form, then 
from dimensionality considerations we must conclude that the uncer- 
tainty relation for energy must have the following form: 


AGAT Yh 


where At is the time interval. 

What is the significance of this time interval and how should the 
uncertainty relation for energy be interpreted? The time interval 
At is the time during which a microparticle possesses an energy 
€ + A. The uncertainty in the energy of a microparticle is deter- 
mined by the time during which it is in the given energy state. The 
uncertainty in energy becomes significant only when the time during 
which it is at the given energy level is measured in minute fractions 
of a second. a 

An atomic electron remains for an indefinitely long period of time 
at its lowest, or fundamental, energy level (see below for a detailed 
discussion). Therefore, the energy of the fundamental state is fixed 
quite regidly. An electron remains for a very short period of time 
at a higher level. Its energy in such a state is @ + A @. Accordingly, 
when an atom passes from a higher energy level to a lower one, the 
radiation frequency cannot be exactly defined, i.e., it lies in the 


narrow band v =: a This may be observed experimentally: spectral 


lines are of finite width, which may be. used to determine the so- 
called lifetime of a microsystem in an excited state. Experiments show, 
for example, that the width of spectral lines in the X-ray region 
is of the order of 10 ev. Thus, in such a case, the lifetime in an excit- 


h 
ed state is of the order of ae 10-16 sec. 


182. The Potential Square Well 


On page 57, we discussed potential curves, which ç, 
the conditions of particle motion. The simplest 
is a right-angled curve, a so-called square well. 
potential energy has a constant value over a sẹ 
ity, we restrict ourselves to the linear case). 
well, the potential energy changes abruptly. 
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high, the potential energy inside the well may be considered equal 
to zero (since the choice of origin is immaterial), and at the edges 
of the well equal to infinity (see Fig. 210). A 
Assume that an electron (or some other particle) is located in the 
well. Let us try to determine the nature of its motion for the simple 
one-dimensional case, i.e., let us assume that the electron is moving 
along the z-axis. If Newton’s laws of mechanics were applicable to 
an electron, such an electron would 
U move continuously—first toward one 
side of the well, where it would be elas- 
tically reflected from the wall, then 
toward the other side, etc. There is no 
other possibility from the viewpoint 
of Newtonian mechanics, since for 


2 
U = 0 the kinetic energy a is con- 


stant. Thus, for motion in a square 
well, the following conclusion may be 
drawn from the mechanics of “ordinary” 
particles: A particle may move in the 


. . ó, 2 
well with any kinetic energy ee or 


it may remain motionless. For any 


tgiven energy, the motion is uniform— 
first toward one side, then oward the other, i.e., the velocity re- 


verses direction abruptly at the end of the allowed interval, 

Now, let us consider the electron motion in such a well from the 
viewpoint of quantum mechanics. 

The behaviour of the electron is characterised by the ap-function. 
The square of this function indicates the probability of finding the 
electron at some point in the given interval. Since U = 0 inside 


the well, the Schrédinger equation becomes simplified and may be 
written in the form 


Fig. 240 


dip 402 
wie =0, where 4=—Ż 


V mB ` 


y the sine and cosine of the argument 
=. If the well is bounded by the coordin 


then for these values p 


This equation is satisfied b 


ates z = 0 and z = 4, 


= 0, since the electron does not penetrate 


the walls. Therefore, cos 2x $ is not suitable as a solution to the 


equation (cos 2a = 1ate= 0). Hence, 
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But the wavelength % cannot be arbitrary, since p = 0 at x = a. 
The following equation must, therefore, be satisfied: 


2n 
z7 (n-+-1)x 
or 
oft any) 0, 4, 2 
i herr On 
i ser where n= 45 25... 

n 2a a 
Thus, the wavelength may assume the values 2a, a, Se hair a 


It is seen that the p-function represents the amplitude of a standing 
wave (see p. 132) and that formally this problem has much in com- 
mon with that of a vibrating rod or string. But if the wavelength 
4 has a discrete set of values, then the energy § of a microparticle 
cannot be arbitrary, i.e., 

g= erne. 


8ma? ? 


the integer n is called the quantum number. 

Thus, the Schrödinger equation leads to the quantisation of energy. 
A microparticle in a square well has a discrete set of energy levels. 
The lowest energy level occurs for n = 0. This energy is equal to 


2 
in and is the zero-point energy of a particle located in a square 
well. 

The existence of a zero-point energy is an interesting peculiarity 
of microparticles. In the case of “ordinary” particles, the lowest 
energy has a value of zero. But under no circumstances can micro- 
particles cease to move. At absolute-zero temperature, a micro- 
particle has a specific zero-point energy, the values of which differ 
considerably depending on the nature of the force field in which 
the particle is located. 


Example. Assume that a=1Å (a characteristic value for an atomic 

region). Then, the zero-point energy of an electron in the Square well is 
h2 (6.6 x 10-27)2 

60 Sma? ~ 8X9.1 X10-5 (10-92 


0.6 10-10 erg =37 ey. 


If a=1 cm (a free electron in a piece of metal), E0=0.6% 10-26 erg = 37x 
x 10-16 ey. 


The velocity of an electron at a given energy level may be calculat- 


h : 
ed by means of the wavelength: =: But in such a case, the 
electron motion cannot be described by the equations of classical 


mechanics. It is not possible to indicate where the electron is located 
31-1409 


re 
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at one or another instant of time. However, the quantity 4°, i-e., 
the probability density of the electron at one or another point in 
space, may be determined from the equation 


pn = 4A? sin? = 
It should be noted that to each energy level there corresponds 
a proper (of same number n) wave function (also called characteris- 
tic function or eigenfunction). 
Fig. 241 shows the »p-function and p*-function for the first four 
energy levels of an electron located in a square well. Quantum 


x= A? sin? = (n+ 1) a. 


yz) 


Fig. 211 


mechanics leads to the conclusion that an electron does not appear 
with equal frequency at different points in the region. If the electron 
energy is a minimum, i.e., the electron is located at the lowest 
level (n = 0), the particle is most often encountered at the centre 
of the “well”; if the electron is located in the state corresponding 
to n = 1, it is never encountered at the centre of the allowed seg- 
ment, etc. The ph curves indicate the frequency with which the 
electron appears at various points in the region. 

Now, let us summarise. On the basis of quantum mechanics, the 
following conclusions may be drawn regarding the motion of a mi- 
croparticle in a square well. Only motion corresponding to a discrete 
set of energy values, $o, G1, 62 .. ., is possible. The particle can- 
not be stationary since even the lowest energy level corresponds tO 
motion with a certain velocity. Data on the nature of particle mo- 
tion for a given energy are provided by the y-function. Knowing 
ap’ (z), one may determine at what points in the region the micro- 
particle appears often, and at what points rarely. 
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It remains to be determined under what conditions an “ordinary”, 
i.e., classical, description of particle behaviour is valid. 

Imagine that an oxygen molecule is enclosed in a box the dimen- 
sions of which are scores of times greater than the dimensions of the 
molecule. Assume that the molecule has the average energy, at room 
temperature, of a, molecule of oxygen gas, i.e., 107 erg. Using the 
values a@= 100A, m= 5.4 x 10°8 gm and ¢ = 1078 erg, one 
may determine the quantum level of the microparticle. The result 
is n = 1,000. Two conclusions may be drawn. First, the ph curve 
has such a large number of alternating maxima and minima that 
the probability density of the particle is approximately the same 
for all points in the box. Secondly, neighbouring energy levels are 
very close. 

Both characteristics which follow from the fundamental equation 
of quantum mechanics have disappeared: the probability distribu- 
tion of the particle is practically indistinguishable from a smooth 
curve and the energy levels are so close that energy discreteness is 
imperceptible. In such a case, quantum mechanics yields approxi-. 
mately the same results as particle mechanics. This is true whenever 
the particle energy corresponds to a large quantum number. We 
have illustrated an important principle of quantum ‘mechanics: 
when the quantum number is large, the results of quantum mechan- 
ies coincide with those of the mechanics of “ordinary” particles. 
This means that when z is large the particle-path concept and other 
characteristics of ordinary particles are applicable to microparticles 
as well. 


183. Significance of the Solution of the Schrödinger 
Equation 


We have devoted a relatively large amount of space to the motion 
of a particle in a square well. On the basis of this simple example, 
we were able to illustrate the basic features of the quantum-mechan- 
ical method of considering problems. If an electron or other particle 
is able to execute motion in a limited region, the characteristic 
features of the solution of the Schrödinger equation are preserved, 
no matter what the form of the potential curve in this region. In 
all cases, the potential well may be intersected by a number of hori- 
zontal lines— possible energy levels. In principle, the Schrödinger 
equation enables us to calculate these energy values if the form of 
the potential well is given. The lowest level gives the zero-point 
energy of a particle in a given potential well. 

For each energy level of number z, quantum mechanics establishes 
a set of wave functions pr (x, Y, 2)- The quantity pr (x, y, z) gives 
the probability of finding a particle at a given point in the region if 


i 31* 


front of the barrier or beyond the barrier. 
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the energy of the particle is @,. Since the particle manages lo be at 
all points in the region more than once during the time of meas- 
urement, p? (z, y, zZ) may be viewed as the density of the “particle 
cloud”. The electron cloud surrounding an atomic nucleus resembles 
a photograph of the atom taken with long time-exposure. The yp-func- 
tion gives the amplitude of the wave associated with the particle. 
In the case of an electron in a square well, the p-function consists 
of standing waves and to every level there corresponds a character- 
istic wavelength A. : 

This is not the situation in the general case. The standing “waves’ 
corresponding to a given state (given n) will appear very strange: 


their wavelengths A= Va will differ at different points 


in the region, depending on the nature of the potential “curve” 
U (x, y, z). For more or less complex cases, there is only slight simi- 
larity between the ‘p-function and the amplitude of a standing wave 
(in the usual sense of the term). 

Theoretical and experimental evidence indic 
of cases several p,-functions may correspond to a single ¢, energy 
value. This occurs if at one energy a particle may have states which 
differ with respect to another physical quantity, e.s., angular mo- 
mentum. The forms of the xp? clouds of such a state—called a degen- 
erate state—may differ radically from one another. 

The problem of particle motion in a given type of potential well 
is solved when the energy levels are found and the characteristic 
tp-functions are calculated for all levels. If the solution of the Schré- 
dinger equation is known, the result of one or another measurement 
performed on the given particle may be predicted. 


ate that in a number 


184. Tunnelling Through a Barrier 


We shall now discuss a peculiar effect which may occur in the case 
of a microparticle, but not in the case of an ordinary particle. This 
is the tunnel effect, i.e., the “leakage” of a particle through a poten- 
tial barrier. 

Imagine that inside the region in which a particle moves, there 
is a potential barrier of height U and width d (Fig. 212). If the ener- 
gy of the particle is @ < U an ordinary particle could be either in 

The particle cannot pass 
through the barrier, since this would require that the particle have 
a negative kinetic energy and an imaginary velocity, which is absurd. 
Matters stand differently with respect to microparticles. The uncer- 
tainty principle does not permit us simultaneously to ascribe to 
a microparticle exact values of velocity and coordinate, and hence 


` 
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of kinetic and potential energy. That is why a particle having a total 
energy 6 may pass through the barrier. 

The conditions for such passage may be determined as follows. 
Coordinate and momentum uncertainties are connected by the*rela- 
tion AxApæh. Momentum uncertainty 
is uniquely related to kinetic energy 

3 
uncertainty since K = L. If AK isa 


quantity of the order of U — &, where 
6 is the particle energy and U is the 
height of the barrier a particle to the 
left of the barrier (see figure) has a co- 
ordinate uncertainty 


Nop oir fee 
i Ap w 2m (U—@) : 
If the barrier width d is less than Az, Fig. 212 


the particle may be located on the 
other side of the barrier. The particle tunnels its way, so to speak, 
through the barrier at the total energy level @. 

Thus, the condition for tunnelling is 


dV Im(O—6) <h, icn +VImU—S <1. 


It may be easily shown by numerical examples that the phenomenon 
is of significance only for microparticles. 

For U—€=10 ev~10-" erg, m~ 10-27 gm (electron mass) and d~ 
~ 10-8 cm, £ Vm (U=) =0.2<1, i. e., tunnelling ‘is possible. 

For a 1-gm spherule lying next to a match box (U —'6 = 3,000 ergs and 
d=2 cm), Vim (T= a) =2.5X108 > 1. It is evident that the spherule 
cannot “tunnel” through the match box. 


The probability of leakage through a barrier may be calculated. 
t turns out that this probability is proportional to 

-42 Vim(=éya 

eiet 4 

The tunnel effect could be predicted on the basis of the Schrö- 

dinger equation. The solution of this equation shows that even at 
Points where U > & the -function has values differing from zero: 
Thus, there is a certain probability—inversely proportional to the 
magnitude of U — € —that an electron is located in a region where 
in the language of “ordinary” particles it possesses negative kinetic 
energy, 
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ATOMIC STRUCTURE 


185. Energy Levels of a Hydrogen Atom 


A hydrogen atom has one electron which “rotates” in a nuclear 
field. One would think that the problem is simple. But even for this 
most simple atom, the solution of the Schrédinger equation is very 
cumbersome and, therefore, 
will not be reproduced here. 
However, we shall give the 
results of the calculations 
and discuss them in con- 
siderable detail. 

An electric force of Cou- 
lomb attraction acts be- 
tween the electron and the 
nucleus. The potential ener- 
gy of an electron ina nu- 


clear field is U=—“& 


7 
where e is the charge of an 
electron (the same as the 


charge of a proton) and 


a VEFEZ is the distance between the electron and 
the nucleus. 


The Schrödinger equation has the form 


’ 


Fig. 213 


822m 


_ byt (e+) p=0. 


Such an atom constitutes a peculiar kind of potential well and is 
illustrated in Fig. 243. This is a well without a bottom and with 
divergent sides. The sides of the well are hyperbolas and the axis 
PAO) is one of the asymptotes. The electron inside the atom has 
a negative potential energy* since the minimum value of potential 


* It may be asked: why has the origin of potential energy been chosen in 
such amanner that the electron energy is negative? The advantage of such 
a choice is not difficult to see. For different atoms, the potential Sy has the 
pame value only when r —> co. It is natural to set this common value equal tO 
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energy tends to infinity when r— 0 and the maximum value is 
equal to zero. r. 

Fig. 214 shows the energy levels obtained from the solution of the 
Schrödinger equation. An important feature of the solution is the 
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drawing together of the levels as the quantum number n increases. 

ransitions between levels will be discussed below. The scales of 
Values, which are proportional to energy, are given in the units 
adopted in spectroscopy: volts and reciprocal centimetres. The energy 
level formula may be written in the form 


202me4t 
Cn semana 


w. 
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For historical reasons, it is customary to write this formula in the 
form AI 
chh 

n= — 


pa s 


2eme = 109,740 em! is the Rydberg constant. 

Thus, it transpires that the total energy as well as the potential 
energy of the atomic electron is negative. The atomic electron may 
be located at any one of n levels. The greater the value of'n, the 
higher the energy level, and the greater the energy possessed by the 
electron. The electron of a free hydrogen atom on which no force 
acts is at the lowest energy level: 6; = — cRh. 

If an energy greater in magnitude than cRh is transmitted to the 
atom the electron leaves the bounds of the potential well, i.e., the 
atom becomes ionised. The energy cRh is called the ionisation energy. 

It is customary to characterise the work of tearing away an elec- 
tron from an atom by the ionisation potential. For a hydrogen atom, 


y cRh _ 3x 1010 x 109,740 x 6.6 x 10-27 
ton i 4.8 X 10-10 


= 4.5 x 107 statvolt= 13.5 volts. 


where R = 


The reason for the above designation is the following. Let us as- 
sume that the electron is torn away from the hydrogen atom by the 
action of a beam of electrons. To ionise a hydrogen atom, one must 
accelerate electrons, which act as projectiles, to an energy of at 
least eV = cRh. Therefore, V is the potential difference through 
which an electron must be accelerated in order to produce ionisation 
of collision with a hydrogen atom. 

If the energy imparted to a hydrogen atom is less than cRh, 
a transition of the atom occurs to one of the n levels*. Such an atom 
is said to be in an excited state. 

An atom stays in an excited state for a small fraction of a second 


and then passes to a lower level with the emission of a photon in 
accordance with the equation 


hymn =8m— En =c Rh ( fis. E ) A 


n2 m? 


If hydrogen atoms are excited by different kinds of collisions, they 
are raised to different energy levels and return to the ground state 
by “skipping” over levels (see Fig. 214). Therefore, a large concentra- 
tion of hydrogen atoms will radiate photons of every possible Vmn 
frequency. A characteristic line spectrum of emission arises. 


* In the case of a hydrogen atom, which has one 
atom is at an energy level n” and “the electron is a 
the same meaning. 


electron, the phrases “the 
t an energy level n” have 
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By calculating for a given z the v,, frequencies corresponding to 
the numbers m = n + 1, n+2,... , we obtain a series of frequen- 
cies of lines in the hydrogen spectrum. The existence of such series 
was known long before quantum- mechanics was developed. The 


Ba 


Fig. 215 


series corresponding to n = 2 is known as the Balmer series. By sub-- 
stituting m = 3, 4, 5, 6, 7 and 8, respectively, in the formula, let 
us calculate the wavelengths of six of the lines in this series: A; = 
= 6,562.80 A; A, = 4,861.38 A; 4, = 4,340.51 A; A, = 4,101.78 A; 
Ay = 3,970.11 A; and As = 3,889.09 A. It is apparent that the 
separation between lines decreases asm increases, which is in accord- 
ance with experimental results (see Fig. 245). Experimental and 


calculated values do not differ by more than 0.05 A. 


186. Quantum Numbers 


The solution of the Schrédinger equation enables us to determine 
all of a hydrogen atom’s energy levels, n, as well as all of its wave 
functions. In the ground state, an electron is characterised only 
by the function yy. As for the excited states, they are degenerate to 
the square of n, to use the terminology of quantum mechanics. This 
means that there are four w-functions corresponding to the energy 
62, nine corresponding to @3, etc. Eeach of these states may actual- 
ly exist. 5 

How do the n? states having the same quantum number n differ 
from one another? Quantum mechanics provides the answer to this 
question. States with one and the same energy value n may differ 
with respect to the magnitude of the electron’s angular momentum 
as well as the value of the angular momentum’s projection on a cer- 
tain selected axis. A 

The solution of the Schrödinger equation for a hydrogen atom 
shows that the electron’s angular momentum has a discrete series of 


~ 
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values given by the formula 
PAA i 
L=V TEDER 


where / may assume any integral value from 0 to n — 4 when the 
electron is at the n-th level. 

Moreover, the Schrödinger equation shows that relative to the 
selected direction z the angular moment Z must be oriented in such 
a manner that 


h 
L,=m>— r 


where m is an integer that may assume any value from —Z to +l, 
including zero. 
It should be recalled that according to the uncert 


L and L, give us all that we can possibly know about the angular 
momentum; in other words, it is 


meaningful to specify simultane- 
ously only these two quantities. 


Thus, the state of an electron in an atom is characterised by three 
quantum numbers: n, Z and m. The number n is called the principal 
quantum number, J the reduced azimuthal quantum number, and 
m the magnetic quantum number. 

The states with Z = 0, 4, 2; 3, .. are designated by the letters 
Sy JORIS Te ech Mn respectively. The principal quantum number pre- 
cedes one of the above letters. For example, the 3p state is the state 
with n = 3 and 1 = 1, 


Let us list all of the possible states for n = 1,2 and 3: 


ainty principle 


a oa [eea] n 
1 0 1s 0 
2 0 2s 0 
4 2p —1,0,4 
3 0 3s 0 
1 3p EO 
2 3a —2, —1, 0, 4; 2 


The energy transitions of a hydrogen atom are determined exclu- 
sively by the values of the pr 


incipal quantum number z. In order 
for the J and m numbers to play a part, the degeneracy must be 
“removed”, i.e., the energy of states with a different angular momen- 
tum must be changed. In the case of hydrogen atoms, this may be 
done by placing the i 


l atoms in a magnetic field. In other cases, de- 
generacy is removed by electron interaction (see below). 


| 
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187. The Electron Cloud of s and p States 


The state characterised by the three numbers n, 2 and m is de- 
scribed by the wave function pn, z, m: The characteristic form of the 
electron cloud corresponding to this state is determined by the func- 
tion 3,1, m. Let us consider the form of the p?-functions of a hydro- 
gen atom which characterise its various excitation states. 

Consider the s states. Since Z = 0, m is also equal to zero. Hence, 
for every n there is only one ap-function: The equation l = 0 means 


Fig. 216 


that the electron has no angular momentum. This requires, of course, 
that there be no favoured directions of motion, i.e., the electron 
cloud must have spherical symmetry. Such is the result obtained 
pam the Schrödinger equation: the functions Wpis, Pos, Was, ete., 
lave spheri mmetry- 

Fig. O16 Re cane of radial density distribution of the electron 
cloud or, what amounts to the same, the probability density distri- 
bution of the electron. The quantity 4su%p?, which gives the radial 
density, is plotted along the ordinate. It is evident that 4su*y?dr 
represents the number of electrons* contained in a spherical shell 
the inner and outer radii of which are r and r + dr, respectively. 
The radial density curves show that in the 4s state there is one 
electron density maximum, which for a hydrogen atom is located 


* A fractional number of electrons should not disturb us, for this is only a 
Manner of speech. Strictly speaking, nrp? dr is the probability of an elec- 
tron being inside a spherical shell of dr thickness. 
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at a distance of 0.53 A from the nucleus. In the 2s state, there axe 
two density maxima; the electron will be within the second maxi- 
mum most of the time. Finally, in the 3s state, there are three den- 
sity maxima; here, the electron will be within the third maximum 
most of the time. 


As the principal quantum number n increases, the electron cloud 
becomes dissipated. 


The p-state functions look entirely different. There are three 
values of m, namely, 0, —4 and +4, corresponding to Z = 4. The 


m=+] x m=-1 


Fig. 247 
electron-cloud configurations are illustrated in Fig. 217. For m = 0, 
the major axis of the “figure eight” is oriented along the selected di- 
rection; for m = + 1, the major axis is perpendicular to the selected 
direction. It is evident that the m — = 1 states may be meaning- 
fully distinguished only when both are present. The figure gives 
some indication of the symmetry of the electron cloud. It is the 
same for all states. A change in the principal quantum number 
merely results in a change in the nature of the radial drop in den- 
sity: the greater the value of n, the more extended the curve. 


We shall not discuss states with large values of Z. Their electron 
clouds are more complex. 


188. Pauli’s Exclusion Principle 


Atoms are arranged in the Mendeleyev periodic table in accord- 


ance with the number of electrons contained in them. Thus, helium 


has two electrons, lithium three and beryllium four. On the basis 
of the Schrédinger equation, what can be said about the structure 
of atoms? 


At first glance the problem may appear hopeless. Even in the 
case of helium, strict adherence to procedure would involve solving 
the Schrédinger equation for a wave function with six variables, 
P (Zi, Yis Zis Le, Yo, Z2), the square of which gives us the probability 
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of finding the first electron at point 2, Y1, 2; when the second electron 
is al point 22, ys, Z2. The potential energy to be substituted in the 
equation is 


where r; and rz are the distances of the electrons from the nucleus 
(the nuclear charge of helium is 2e) and r,s is the distance between 
electrons. An exact solution of such a problem is quite impossible. 

It would be extremely desirable to deal separately with each 
atomic electron and describe each such electron by its wave func- 
tion 1p (x, y, z). But how can this be done? Evidently, it is necessary 
to consider the motion of one electron in the field of the nucleus 
and the remaining electrons. It may be assumed that this effective 
field has spherical symmetry. Therefore, the description of the 
properties of such an electron will not differ from that of the electron 
of a hydrogen atom. 

To be sure, the problem is still quite difficult: for different elec- 
trons these effective fields differ and, moreover, all must be deter- 
mined simultaneously since each depends on the states of the 
remaining electrons. Such an effective field is said to be self-con- 
sistent. This approach to the problem of a multi-electron atom 
enables us to apply, to a large extent, the description of the prop- 
erties of a hydrogen atom’s electron to the behaviour of an electron 
of a complex atom. 

The state of each electron will be characterised by the same quan- 
tum numbers as in the case of hydrogen. However, in the case of 
a multi-electron atom, degeneracy is removed by electron interaction, 
and levels with different / and m values will have different energies. 

The Schrédinger equation enables us to determine the energy 
levels that are possible, but does not indicate the energy of the 
atomic electrons. One might think that all the electrons of an atom 
occupy the lowest energy level. In any case, such would be the 
behaviour of “ordinary” particles. But experiments completely refute 
Such a supposition. The “arrangement” of electrons in accordance 
With energy levels is governed by the Pauli exclusion principle. 
The first conjecture that such a principle exists was based on a study 
of the Mendeleyev periodic table. 

As was indicated above, it follows from the Schrödinger equation 
that (22 + 1) states exist for a given n and J. It was also indicated 
that this in turn yields n? different wp-functions for a single value 
of n. The first values of n? are 1, 4 and 9. Let us examine the Men- 

eleyey periodic table. Helium, neon and argon, which complete 
the first three periods of the table, contain 2, 8 and 18, i.e., 2n2, 
electrons, respectively. This is by no means accidental. It is an 


| 
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expression of a profound law according to which only two electrons 
can have the same p-function. In other words, an energy level cannot 
be occupied by more than two electrons. This general law of nature, 
to which we shall return in Sec. 190, is called the Pauli exclusion 
principle. . 

By means of this principle, we can “arrange” the electrons ol a 
complex atom in accordance with quantum numbers and, therefore, | 
in accordance with energy levels and values of angular momentum. 
A helium atom has two electrons, which may occupy the single 1s 
level. The third electron of lithium must be located at the next 
level, i.e., the 2s level. A beryllium atom has four electrons, which 
occupy the 1s and 2s levels. The fifth electron of boron is at the 2p 
level. At this level, there are six places for electrons, which are all 
filled when neon is reached. But let us postpone consideration of 
the relation of the Mendeleyev periodic law to the electron structure 
of an atom until Sec. 192. 

Do two electrons occupying a level which is characterised by the 
same three quantum numbers differ in any way? It turns out that 
two such electrons differ with respect to the orientation of their 
internal angular momentum (“spin” orientation). 


189. Deflection of an Atomic Beam in a Magnetic Field 


In the preceding articles, the angular momentum of an electron 
due to its motion about a nucleus was discussed in considerable í 
detail. The presence of such angular momentum may be demonstrat- 
ed since an atom acquires a magnetic moment as a result of the 
motion of its electron. 

Let us employ classical concepts and assume that the electron 
revolves in a circle of radius r. Since current is equal to the charge 
transferred in a unit time, the equivalent current of such a revolving 
electron is Z = ne, where n is the number of revolutions per second- 


On the other hand, n =z where v is the velocity. Thus, 


= ve 
~ Qur? 
and the magnetic moment of the revolving electron (see p. 263) iS 


1 1 ve 4 1 
M= S= 5 X tr? =5- eur. 


L, the angular momentum of an electron, is equal to mur. Hence: 


e 
M= aa L. 


This relationship between the angular momentum and the magneti? 
momentum of an electron moving about a nucleus, obtained 


on 
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means of the above simple calculations, has been exprimentally 
confirmed for all atomic electrons. 

Thus, atoms for which Z Æ 0 possess magnetic moments, and it 
may be demonstrated in a suitable experiment that such atoms 
behave like small magnets. Fig. 218 illustrates such an experiment 
in which a parallel beam of atoms passes through a nonuniform 
magnetic field. 

Air is carefully pumped out of a long tube. At the left end, an 
atomic gas is created. Atoms in the left compartment can get out 


Fig. 218 


through small apertures in the screens. Since there are two aper- 
tures, only atoms moving along the axis of the apparatus will remain 
in the beam. This parallel beam of atoms passes through a nonuni- 
form magnetic. field. As was indicated on p. 287, in such a field 
the force acting on a body which has a magnetic moment M is 


aH 


f=M, dy’ 


where a is the field gradient in the direction perpendicular to the 
atomic beam and M, is the projection of the magnetic moment on 
the direction line of the gradient. If the magnetic moment is perpen- 
dicular to the field, no force acts on the body, but if the moment is 
directed along the field, the body is attracted to either the north or 
south pole, depending on the direction in which M is pointed. If 
the atomic beam contains atoms having different magnetic mo- 
ments, or differently oriented magnetic moments, the beam of 
atoms will spread out: atoms travel in different directions and 
different forces act on them. The beam is allowed to impinge on 
a plate until enough atoms hit the plate to be perceptible. 

The above experiment has been of great importance in the devel- 
opment of basic atomic theory. It is still of importance as a method 
of determining the magnetic moments of atomic nuclei. 
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190. Electron Spin 


By means of expe riments with atomic beams, one can measure M 
ind, therefore, L. One of the important deductions of quantum 
mechanics pertains to the quantisation of angular moment L: the 
total momentum L can assume only a discrete set of values, namely, 


b=V 1+)» 


where Z is the azimuthal quantum number. Therefore, the magnetic 
moments of atoms also can assume only a discrete set of values: 


Me ee VTE VN: 


4nmce 


‘The coefficient in this formula, 


7 eh 
H = Zame 


=0.927 x 1072 erg/gauss, 


is called the Bohr magneton” . 

In experiments with atomic beams, the external magnetic field is 
„directed perpendicular to the atomic beam. The projections of L on 
this perpendicular line can also have only a discrete set of values: 


a. 

t 

L,=" > 

hence, y A 
M:=mp, 


where m is the magnetic quantum number. It is seen that the values 
of M, must be equal to a whole number of Bohr magnetons. 

M, may be determined directly from experiments with atomic 
beams. What should be the result of experiments with beams of 
atoms? We may expect the following picture. Hydrogen, helium, 
lithium and beryllium have only s electrons. Since L = 0 for these 
atoms, such a beam does not split up in a magnetic field. When p 
electrons are present, the beam may be expected to split up into 
three distinct components: an undeviated central component for 
m = 0, and two components arranged symmetrically to the right 
and left for m = + 4. For d electrons, we should obtain five beam 
components corresponding to the five possible values of the quantum 
number m, etc. 

These predictions are realised to the following extent: in certain 
cases a beam of atoms does not split up, while in others it splits up 
into distinct components. Thus, it is evident that some atoms have 


* Niels Bohr (1885-1962) noted Danish scientist who developed the first 
theory of atomic structure. 
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no magnetic moment; if an atom has a magnetic moment, it is 
quantised. 

As regards the results obtained for specific atoms, they point to 
a completely new fact, namely, that an electron has an internal 
magnetic moment. : 

Such is the conclusion to be drawn from an experiment with 
a beam of hydrogen atoms: a beam of hydrogen atoms splits up into 
two symmetrical components, which correspond to the deflections of 
atoms with magnetic moments +p. There is no undeviated central 
component! This fact may be explained by the following hypothesis, 
which is supported by other data: an electron has a magnetic moment 
and there are only two possible orientations of this moment in 
space, namely, the projections +p on the direction line of the 
external field. 

The magnetic moment of an electron due to its motion about a 
nucleus is uniquely related, as we have just seen, to the angular 
momentum of the electron motion about the nucleus. It also tran- 
spires that the internal magnetic moment of an electron is related 
to its internal angular momentum, or spin. 

As far back as 1925, before the above experiments which show 
that an electron has an internal magnetic moment were performed, 
Goudsmit and Uhlenbeck suggested that an electron has a spin. 
These physicists showed that such a hypothesis—the existence of 
an electron spin, iie., an internal angular momentum—removes 
insurmountable difficulties in deciphering spectra. At first, it was 
proposed that spin is a consequence of the rotation of an electron 
about its own axis (whence the origin of the term “spin”). But this 
interpretation is incorrect. Electron spin is a primary characteristic 
and is not reducible to something simpler. 

What is the relationship between the internal angular momentum 
(spin) and the internal magnetic moment of an electron? The exper- 
iment with a beam of hydrogen atoms leads to the conclusion that 
M., the projection of the internal magnetic moment of an electron, 
May assume only two values, namely, +u. It may be assumed that 
L., the projection of the spin, may also assume only two values. 


If the formula È NT SN 
P VUE 


is applied to the internal angular momentum of an electron, one 
finds that the number Z has a single value. Thus, as quantum mechan- 
ics indicates, the values Z = 0, 1, 2, . . + correspond to 1, 3, 5, ..., 
and in general (27 + 4) states, respectively. To obtain two spin 
states, which the experimental results show to be the case (22 + 1 = 
= 2), one must assume that l=4. 


32-1409 


al 
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The absolute magnitude of an electron’s internal angular momen- 


Bee 
pe, e the 
ae As for 


projection of spin, assuming as before that the differences in the 
possible values of ZL, must be multiples of = we sce that it can 


tum (spin) has the single possible value L = 


PAS 1 
assume only two values, namely, -+ Spor and — > 


=. Thus, (Lz)sp = 


h 
27 


=6xX uy where ô is a new quantum number (spin number) that 


1 
can assume only two values, namely, +=. 


It was stated earlier that the hydrogen ‘experiment led to deflec- 
tions corresponding to a magnetic moment of one magneton, and 


that M -= p. Since the quantum number ô is equal tot , the relation 
M=- L 
amc 


turns out'to be incorrect for the internal motion of an electron. 
Agreement with experimental results is obtained if 
e 


Msp = Ley. 


2 M 7 a 
Thus, iE? the ratio of magnetic moment to angular momentum for 


electron motion about a nucleus, is one half of the analogous ratio 
for internal motion of an electron. 

The existence of electron spin enables us to formulate the Pauli 
exclusion principle more clearly. There can be no more than two 
electrons in a state having quantum numbers n, l'and m. Such 
electrons differ only with respect to spin projection. Experiments 
show that the spin projections of two such electrons cannot be the 
same. The Pauli exclusion principle may now be formulated as 
follows: There can be only one electron in a state characterised. by 

-four quantum numbers n, l, m and ô. In other words, if there are 


two electrons in an z, l, m state, their spin directions are opposite 
to each other. 


) 


191. Magnetic Moments of Atoms 


The electron spin hypothesis makes it possible to interpret the 
results of experiments with atomic beams. The measurement of 
magnetic moment is one of the most important methods of deter- , 
mining the electron state of atoms. Let us consider the first few 
elements of the Mendeleyey periodic table. 
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We have already studied the hydrogen atom. What happens to 
a beam of helium atoms? Such a beam does not split up. This is as 
it should be. This atom has two electrons in the 2s state. The Pauli 
exclusion principle requires that their spins, and hence their mag- 
netic moments, be oppositely directed; the total magnetic moment 
is equal to zero. 

In a lithium atom, the magnetic moment must be determined by 
the third electron since the actions of the spins of the first two elec- 
irons in the 4s state annul each other. The third lithium electron 
is in the 2s state. Therefore, like in the case of hydrogen, the mag- 
netic moment can be determined only 


by the spin. The splitting up of a lithi- Boron 
um beam into two components shows Se 
that this analysis is correct. Like in m=i 2 
the case of hydrogen, one component = 
3 PSAN 1 = 2 

corresponds to a spin projection aay 
and the other to a spin projection a rey a, + 

A beryllium atom has four elec- ee 
trons—two in the 4s state and two in 2 
the 2s state. Since the spins of the elec- 
trons in each pair are oppositely direct- 3 
ed, they annul each other and yield | __, = ae 
a total moment of zero. Therefore, a east ee 
beam of beryllium atoms does not — 2 
split up into components. Fig. 219 


However, a beam of boron atoms 

splits up into four components. 

How can this be explained? It is evident that the magnetic moment 
is produced only by the fifth electron, which is in the 2p state. 
This state may be realised with three values of the magnetic quan- 
tum number, namely, +14, 0 and —4. Thus, the orbital magnetic 
moment may have three values, including zero. The values of the 
spin magnetic moment must be added to those of the orbital mag- 
netic moment. How is this done? Fig 219 shows all the: possible 
mutual orientations of the moments. It is seen that four combinations 

4 4 


: 7 3 geo WP 
of angular momentum are possible: >, 3; 2 and — -7 (in ae 
units, usually not indicated). 


The splitting up of a beam of carbon atoms is even more complex: 
seven lines, including one which is not deflected, with the following 
values of angular momentum: 3, 2, 1, 0, —1, —2 and —3. 


32* 
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192. The Mendeleyev Periodic Law 


The regularities of the Mendeleyev periodic system become un- 
derstandable when viewed in the light of available dala on the 
electron structure of atoms. ; 

The basic rules of distribution of electrons in an atom according 
to quantum numbers are derived from the Pauli exclusion principle. 
But if we were governed by this information alone, only the electrons 
of the first eighteen atoms (through argon) would be assigned correct 
quantum numbers. This may be explained as follows. Electrons 
fill the energy states of an atom in consecutive order. In a hydrogen 
atom, the energy levels are degenerate, the energy being determined 
by only one quantum number. In multi-electron atoms, degeneracy 
is removed as a result of interaction and the energy of an electron 
in an atom depends on all the quantum numbers. It goes without 
saying that in a hydrogen atom an electron at a level for which 
n = 3 has less energy than an electron at a level for which n = 4. 
This is not necessarily so in multi-electron atoms when the azi- 
muthal quantum number is large. Thus, in a number of cases, the 
“natural” consecutive order of quantum numbers does not correspond 
to the order in which the energy states of an atom are filled. 

It is customary to group the electrons of an atom into shells, 
whereby electrons having the same principal quantum number are 
said to belong to the same shell. The shells are usually designated 
by the following letters: s 

RKs LiM, Ns 9% a 


which correspond to n = 1, 2, 3, 4, ..., respectively. 

The electron cloud of an atom is described in the main by the 
distribution of electrons according to quantum numbers or shells. 
This distribution is represented by formulas which indicate by a 


superscript the number of electrons having the same n and J. For 
example: 


silicon — 4s? 2s? 2p° 3s? 3p; 
calcium — 1s? 2s 2p* 3s? 3p5 452, 


To determine to what extent a shell is complete, one should 
remember that the maximum number of electrons in the subshells 


S ein, Silo 18) Py 10 VAR respectively. These values are 
obtained from the formula 2 (22 + 1). 

Returning to the Mendeleyey table, let us see where the order of 
distribution of electrons according to quantum numbers is violated. 
The first such violation occurs for potassium. The last electron is in 
the 4s level rather than in the 3d level. Calcium, the next element 
in the table, receives another 4s electron. Then, beginning with 
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scandium, the 24st element, the 3d level is built up. But when we 
reach chromium, the 24th element in the table, a new anomaly 
arises. The order of distribution of quantum levels according to 
energies has changed. It becomes unfavourable from the energy 
viewpoint to have two electrons in the 4s level. The configuration 
of chromium is therefore 3s? 3p° 3d° 4s. 

We shall not discuss the remaining anomalies. The electron con- 
figurations for all elements may be found in any physics or chemistry, 
handbook. The main point is the following: the distributions of 
electrons according to quantum numbers, which are explained by 
purely physical methods of investigation (spectral analysis and the 
measurement of magnetic moments), aid us in understanding the 
chemical properties of the various elements. 


193. Ionisation Potentials 


One of the methods used to study the electron configuration of 
an atom is to measure its ionisation energy, i.e., the energy that 
must be expended in removing an electron from the atom. Since 
the energy of an electron in an atom is negative and is reckoned 
from zero (the energy of an electron removed from an atom), the 
ionisation energy is simply equal to the energy level occupied by 
the electron in the atom. It is customary to refer this energy to the 
electron charge and express it in volts. For example, we say that 
the ionisation potential of a hydrogen atom is equal to 13.53 volts. 
This means that to free an electron one must perform work equal 
to that of moving an electron through a potential difference of 
13.53 volts. Fig., 214 shows the significance of this value. s 

In the case of a multi-electron atom, one may find a series of 
ionisation potentials which characterise the levels of the first, 
second, third, etc., electrons, calculating from the position of the 
least bound electron. In this sense, one speaks of the first, second, 
etc., ionisation potential of a given atom.. — — — 5 

There exist many methods of measuring ionisation potentials. 
For this purpose, gases or vapours are placed in an electric field. 
A stream of electrons emitted by a heated filament ionises the gas. 
As long as the energy of a primary electron is insufficient to dislodge 
an atomic electron, the electric current passing through the gas 
does not change. When the energy of the primary electrons becomes 
sufficiently great to dislodge electrons from the atomic gas, a con- 
siderable number of positively charged ions will be present in the 
region and a sharp increase in the electric current occurs. By grad- 
ually increasing the voltage applied to the apparatus, one can 
etermine very accurately the instant when this increase in electric 
Current begins. This critical value of voltage gives the magnitude 
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of the ionisation potential. The values of the first ionisation poten- 
tials of most chemical elements are graphically illustrated in 
Fig. 220. i 

tt is easily seen that the periodicity of this property completely 
conforms with that of the periodic table. It is most difficult to 
dislodge an electron from a helium atom and atoms of the other 


Potential (ev) 


ame 
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{period Atomic No. 
Fig. 220 


noble gases. This is precisely the reason for their chemical inertness. 
The univalent alkali metals have the lowest potentials. This, too, 
is completely understandable to the chemist, who is familiar with 
the exceptional ability of such substances to enter into reactions. 

The values of the first and successive ionisation potentials are 
related to the valency of the atoms. Atoms of the alkali elements 
are univalent because one electron, which is on the outer ring of 
the atoms of these substances, is more weakly bound than the rest 
of the electrons. The first potentials of a caesium atom, for example, 


have the following values: 3.9, 27, 46 and 62 volts. It is seen that 
the differences between the energies necessary to dislodge the first 
and succeeding electrons are quite large. 


194. Atomic Spectra in the Optical Region 

Atomic absorption spectra as well as emission spectra may be 

obtained, but only the latter are of basic importance. Atomic spectra 

of emission in the optical region may be obtained by spectroscople 
investigation of the radiatio 


t n produced by gases and the vapours 
of bodies which are solid at normal temperatures. 
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In order for atoms to radiate, they must be excited, i.e., made to 
pass from a lower energy level to a higher one. When atoms return 
to lower energy levels, an emission spectrum is produced. For every 
transition, there is a corresponding line in the spectrum. 

Atoms may be excited by yarious means. One method consists 
in the use of a gas discharge. The voltage applied to a gas-discharge 
tube accelerates the charged particles in the gas. These particles 
collide with neutral atoms, to which energy is transmitted by impact. 
Another method, which is used in the spectral analysis of metals, 
consists in the creation of arcs or sparks between two electrodes 
made of the material under investigation. Very high temperatures 
are produced in an are or spark, resulting in the vaporisation of the 
substance in the region of the discharge. The atoms are excited 
as the result of collisions. 

An atomic emission spectrum consists of a very large number of 
sharp lines. The radiation frequency corresponding to a given line 
satisfies the equation kvm, = Em —-En. Thus, by measuring the 
frequencies of radiated light, we can determine the differences in 
the energy levels of a given atom. One can reliably interpret atomic 
spectra, i.e., determine energy level patterns, from the values of 
radiation frequencies. Handbooks provide data on the spectral lines 
and energy levels of the chemical elements. 

It should not be supposed that a spectrum contains lines corre- 
sponding to all transitions from any one level to any other. Experi- 
ments have confirmed, and a theoretical basis has been provided 
for, the fact that certain selection rules exist. Certain transitions 
are forbidden, i.e., they do not exist. 

One cannot predict, of course, to which lower energy state an 
excited atom will pass, and what will be the frequency of the radiat- 
ed spectral line. But not all transitions occur with equal proba- 
bility. In principle, the probability of transition from one level 
to another may be theoretically calculated. The magnitude of this 
probability determines, in the main, the intensity of the correspond- 
ing spectral line. > 

Atomic spectra are affected by external fields. If the substance 
under investigation is located in an electric or magnetic field, a 
number of its spectral lines split up into several components. The 
energy of a system having a magnetic moment M and located in an 
external magnetic field H is given by the expression MH (see p. 275). 
States which have the same quantum numbers 7 and J may differ 
from each other with respect to the projection of the magnetic 
moment on the direction line of the magnetic field. Therefore, the 
application of a magnetic field removes the degeneracy of energy 
levels and atomic electrons having different magnetic quantum 
numbers will have different energies. 
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Investigations of atomic spectra of emission in the oplical region 
are of great practical importance. Such a method of spectral anal- 
ysis constitutes a very sensitive means of determining the chemical 
composition (up to 10-19 gm) of substances, primarily alloys, and 
in a number of cases is more sensitive than chemical analysis. 

Optical frequencies usually arise with relatively weak excitation 
of an atom, i.e., when outer, valence electrons are transferred to 
a higher level. But even a very “high” electron can produce a broad 
spectrum. It would appear that the radiation frequency has no lower 
limit. Thus, the energy level diagram shows that as z increases the 
levels come closer together (Fig. 214 shows the levels and transi- 
tions for hydrogen, but in principle the patterns are the same for 
other atoms). This means that transitions corresponding to very 
low frequencies (long wavelengths) occur. However, experiments 
show that spectra produced by outer electrons, even though they 
extend into the infrared region, do not include lines of very long 
wavelength. It must be concluded that the probability of a transi- 
tion to some energy level such as the 21st is not large, and the proba- 
bility of a transition from the 21st to the 20th, for which a photon 
of low v would be radiated, is quite negligible. 

In the direction of high frequencies (short wavelengths), the fre- 
quency is limited by the ionisation potential. With respect to the 
“highest” electron, the potential of helium is the greatest and that 
of caesium the lowest, viz., 24v and áv, respectively. This, corre- 
sponds to radiation, frequencies of 6 X 10" cps (4 = 500 A) and 
10" eps (A = 3,000 A), respectively. Thus, only a high-level electron 
can bring us into the region of very short ultraviolet wavelengths, 
which, relative to characteristic X-ray radiation, may also be called 
a region of very long wavelengths, 

It is quite understandable that electrons of inner shells can be 
raised to high levels with strong excitation. In such a case, the 


characteristic spectrum includes X-rays. 
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In multi-electron atoms, the ionisation potentials of low levels 
reach high values, The excitation of such atoms may, therefore, 
result in the radiation of X-rays (wavelengths of the order of 0.1-10 A). 
An energy of the order of 104 ey must be imparted to an atom in 
order to produce X-ray radiation, This may be achieved in gas- 
discharge tubes by applying a voltage of tens of thousands of volts. 

One may calculate the value of the temperature at which an atom 
begins to radiate X-rays due to thermal collisions with other atoms. 
If the average kinetic energy per degree of freedom is to be of the 
order of 104 ev, the temperature must be of the order of 108 °K. 
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Such high temperatures are achieved in solar and celestial atomic 
explosions (see p. 582). The X-ray radiation of the Sun may be 
determined by means of instruments placed in artificial Earth 
satellites. 

A practical method of obtaining X-rays is by the bombardment 
of a solid (the anti-cathode of an X-ray tube) with a stream of elec- 
trons. Electrons impinging on the anti-cathode are abruptly braked 
As a result, a continuous spectrum of X-rays is obtained. The elec- 
tron energy, which has been increased to a value 61 by acceleration 
in an electric field, decreases to a value G2 as the result of braking. 
The energy difference @, — Ê: = hv- is released in the form of 
radiation. G2 may assume any value from @, to zero. Hence the 


radiation frequencies lie between the limits —— and zero. The 


electron energy which does not go into radiation is transformed into 

heat. Only about one-hundredth of the energy of the electron beam 

is transformed into X-ray energy. Evidently, the continuous X-ray 
c he 

Substi- 


spectrum has a short-wavelength limit Amin = 
tuting the values of the constants, we obtain 
12.3 


Amin = Van 


Vmax eV 


where A is expressed in Angstroms and V in kilovolts. Beginning at 
a very definite wavelength, the continuous X-ray spectrum increases 
in intensity with increasing wavelength, reaches a maximum several 
score Angstroms from the short-wavelength limit, and then slowly 
decreases in intensity. 

Investigations show that sharp lines, having a characteristic form 
for every element, are superimposed on the continuous spectrum. 
A characteristic X-ray spectrum arises owing to the fact that some of 
the electrons which impinge on the anti-cathode penetrate the atoms 
and dislodge inner electrons, i.e., electrons in the K, L, etc., shells. 
An X-ray quantum is produced when a high-level electron passes 
over to a vacated low-level position. The set of spectral lines due to 
electron transitions to the K level is called the K series, to the Z 
level the Z series, etc. If the voltage applied to an X-ray tube is 
increased, the series will appear in consecutive order because, as 
the energy of the electrons impinging on the anti-cathode is increased, 
more and more low-energy levels will be consecutively vacated and 
made available for transitions. The K series will be the last to appear. 

The general scheme of electron X-ray transitions is shown in 
Fig. 221, where heavy dots indicate initial levels: The most intense 
lines are marked on the diagram. However, some transitions are 
missing since they are forbidden by the selection rules. This pertains, 
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for example, to transitions with the same value of azimuthal quan- 
tum number. 

Since the configuration of completed lower shells is the same for 
all atoms, X-ray spectra diagrams of different atoms are very similar 
to each other. All spectra contain typical sequences of lines, which 
are systematically shifted along the wavelength scale in accordance 
with the atomic number of the elements. For example, all elements 
produce a strong « doublet (Ka, and Kæ») and a weaker B doublet. 
Quite often these doublets are unresolved. In such a case, one speaks 
of the œ line and f line of a given element’s K series. These doublets 
are of a “spin” nature. 

Fig. 222 shows that a characteristic spectrum is gradually shifted 
into the short-wavelength region with increasing atomic number of 
the element giving rise to this spectrum. This law was discovered 
by Moseley. Its physical basis is the steady increase of the inter- 
action force between an electron and a nucleus with increasing 
nuclear charge. The formula expressing this law will not be present- 
ed, but the systematic displacement of the lines is clearly evident 


from the figure. 


CHAPTER XXIX 


MOLECULES 


196. Chemical Bonds 


A molecule is a stable configuration of atoms. Every atom in a 
molecule occupies a stable position. The displacement of an atom in 
any direction results in an increase in the potential energy of the 
molecule. When an atom approaches a neighbouring atom there is 

a force of repulsion, and when it recedes a force 
v of attraction. Every atom of a molecule, and the 
molecule as a whole, is in a potential well. 

The form of the potential curve of an atom 
or molecule is quite evident (see Fig. 223). Since 
, it is not possible to reduce the distance between 
—___* atoms to zero, the curve of potential energy as 

a function of their separation rises sharply as 
this distance decreases. In the direction of 
increasing separation, the curve rises from the 
equilibrium position, i.e., the bottom of the 
4—— ~ well, much less sharply. Variations are possible: 
U The potential energy at great distances may be 
more or less than at the bottom of the well and 
the well may have or may not have a clearly 
defined wall. The energy of a molecule may be 
4 more or less than the sum of the energies of its 

Fig. 223 atoms. Accordingly, when atoms are combined in 
a molecule, heat is either released or absorbed 
(see p. 568). 
An atom in a potential well is bound to its neighbours. What is 
the reason for this bond? Do various types of bonds exist? Jonic 
and homopolar bonds are two ideal classifications of chemical bonds. 
In the overwhelming majority of cases of interest in chemistry, one 
of these two types of bonds, or an intermediate case in which both 
ideal types coexist, occurs. 

If an atom can transfer one or several electrons to another atom 
electrostatic attraction will occur between the ions which are formed. 
This is what is meant by an ionic bond. At a certain interatomic 
separation which is characteristic of this pair of ions, the forces of 
electrostatic attraction are counterbalanced by the repulsion of the 
electron clouds of the atoms. 
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If an atom is to transfer an electron to another atom, it is neces- 
sary that this process be advantageous from the energy viewpoint. 
In such a case, the simple tendency to pass over to the lowest energy 
level will result in the transfer of an electron. = 

It was shown on p. 488 that an electron can be torn away from 
a neutral atom by an expenditure of energy equal to the product of 
an electron charge and the ionisation potential of the atom. Thus, 
the formation of a positive ion is always associated with an expendi- 
ture of work. On the other hand, the formation of a negative ion, 
i.e., the attachment of an electron to a neutral atom, is associated 
with a release of energy. To be sure, this applies only to the first 
electron. The attachment of a second electron to a singly charged 
negative atomic ion requires an expenditure of work to overcome 
the electrostatic repulsion. 

An ionic bond can exist if the energy of tearing away an electron, 
i.e., the work of creating a positive ion, is less than the sum of the 
energy released in the formation of negative ions and the energy 
due to electrostatic attraction between the ions. 

The alkali metals, in which the last electron is just beginning to 
form a new shell, have the lowest ionisation potentials. In the 
alkaline earth metals, each atom has two loosely bound electrons. 
It-is evident that the formation of a positive ion from a neutral 


atom requires the least work when the electrons to be torn away 


are just beginning to form a new shell. 
On the other hand, it turns out that the most energy is released 


when an electron becomes attached to a halogen atom, in which the 
outer shell is one electron short of being complete. Therefore, in a 
large number of cases, an ionic bond is formed when a transfer of 
electrons resulting in the creation (in the formed ions) of closed 
electron shells, characteristic of atoms of the noble gases, occurs. 
In this way, the physical significance of potential wells in such 
molecules as NaCl and MgCl, may be easily explained. 

However, this explanation is not valid in all cases. For example, 
diatomic molecules of hydrogen, oxygen, etc., are not covered by 
this explanation. It cannot be assumed that in uniting one of these 
atoms is transformed into a negative ion and the other into a positive 
ion. Theoretical arguments need not he mustered. The physical 
properties of molecules formed of ions indicate whether an ionic 
bond exists or not. Specifically, ionic compounds dissociate and 
form electrolytes. A large class of organic molecules do not behave 
in this manner. Therefore, for such substances, an ionic model is 


clearly not applicable. ; 
How can one explain the bond between atoms of such molecules? 
We must determine whether or not a gain In energy occurs when, 


say, two hydrogen atoms unite to form a molecule. 
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Such a gain does take place, and the conditions for its occurrence 
are given by quantum mechanics. As was stated on p. 479 the electron 
of a hydrogen atom behaves, in the main, like an electron in a po- 
tential well. The zero energy level of an electron in a potential well 
is determined by the dimensions of the well (see p. 480), i.e., the 
smaller its dimensions the greater the zero-point energy. Thus, any 
expansion of the region in which the electron could move results 
in a decrease in energy. 

Now, imagine that two hydrogen atoms, which have one electron 
each, come into contact with each other. Since the Pauli exclusion 
principle allows two electrons to be in one state, the regions in 
which the electrons exist may merge and create a potential well of 
increased dimensions. This can occur only for two electrons having 
opposite spins. 

If a third atom approaches the hydrogen molecule which has 
formed, the argumentation used above is no longer applicable. The 
third electron cannot merge its region of motion with that of the 
electrons in the hydrogen molecule since this is not allowed by the 
Pauli exclusion principle: the vacant sites of the hydrogen molecule 
are occupied by two electrons of opposite spin. 

Thus, the second type of bond, a so-called homopolar bond, is 
provided by a_pair of electrons of opposite spin. In the case of an 
ionic bond there is a transfer of electrons from one atom to another, 
but here the bond is achieved by joint action of the electrons, i.e., 
it is as if a common region of motion has been created. An expansion 

- of the region in which an electron may move results in a decrease 
in energy and this is the reason for the formation of a potential 
well. This bond—the merging of the electronic clouds of electrons 
aonne opposite spins—is the main type of bond in organic mole- 
cules. 

Each atom is capable of forming a limited number of homopolar 
bonds. Two electrons of opposite spin, having a common “living 
space” in the form of the overlapping clouds of their wave functions, 
take part in the creation of each bond. i 

As we know, s electrons have spherically symmetrical p-func- 
tions, but the wp-functions of P, d and f electrons extend in specific 
directions. Therefore, a homopolar bond between any two electrons, 
except s electrons, will be a directed bond. If a bond has formed 
between two atoms, the electronic clouds of these atoms assume a 
definite orientation relative to the first bond line. Thus, only cer- 
tain specific angles are formed between bond lines emanating from 
these atoms. The values of such normal bond angles may be derived 
from quantum mechanics for all atoms. 

To a certain extent, both types of bonds are ideal. W 


s J e frequently’ 
encounter cases in which the physical and chemical 


properties of 
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a molecule make it necessary to adopt an intermediate bond mecha- 
nism. In an ionic bond an electron is completely transferred from 
one atom to another and in a homopolar bond each electron belongs 
equally to both bound atoms, but in intermediate cases the electrons 
implementing a bond may spend more time near one of the atoms 
than the other. Such a model reflects, for example, the existence 
of an ionic bond in which the bond electrons belong most of the 
time to a negative ion and the existence of a homopolar bond in 
which the bond electrons spend almost the same amount of time 
with cach of the bound atoms. Intermediate bonds of any percent- 
age of “ionocity” are possible. 
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A vast amount of data on spacings between the centres of atoms 
in molecules and crystals has been accumulated. Most of this data 
has been obtained by diffraction methods. If we do not insist on 
a very high degree of accuracy, it turns out that it is possible to 
represent molecules by models which give the shape and dimensions 
of the molecules. 

Models of molecules of the NaCl type, in which the atoms are 
joined by an ionic bond, are particularly simple. Each ion may be 
represented by a sphere having a definite radius. The dimensions 
of a number of ions are given in the following table: 


Ion Lit Nat Kt Cs+ p cl- Br- TE 


2.16 


1.33 | 1.69 | 1.36 1.81 | 1.95 


Ton radius, A | 0.60 | 0.95 


By means of such a table, we can determine the spacing between 
the centres of ions in any salt. For example, in NaCl it is equal to 
0.95 + 4.81 = 2.76 A. 

But what is the significance of the ion may bi 
represented by a sphere? To show that such a representation is justi- 
fied, we must determine how closely to a molecule (say NaCl) another 
ion (sodium or chlorine) may approach. This is possible since exper- 
iments indicate that both fused and solid salts consist of ions. It 
turns out that a second and a third ion approach a given ion just 
as closely as the first. Moreover, ions the charges of which have the 
Same sign may also approach each other to a distance equal to the 
sum of the ion radii. Thus, ions behave like spheres. 

An important conclusion regarding ionic molecules may be drawn 
from these geometric facts. Let us assume that a group of molecules 


assertion that an ion may be 
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are gathered closely around one of the molecules. The arrangement 
of ions is shown in Fig. 224. Since complete equalily exists in inter- 
atomic spacings, it-is no longer possible to say with which chlo- 
rine neighbour of a given sodium ion, or with which sodium neigh- 
bour of a given chlorine ion, a molecule is formed. The concept 
of a molecule has lost its meaning. 

We must conclude from the geometric arrangement of atom centres 
that in a concentrated state, i.e.; in a liquid or solid, where the 
atoms are linked by ionic bonds, molecules do not exist as distinct 
formations. The concept of a molecule turns out to be inapplicable. 

But what ‘is the situation in the case of a gas? Upon vaporisation, 
a pair of ions of opposite charge, the net electric charge of which 
is equal to zero, is most easily torn 
away from a liquid. Therefore, basi- 
cally, molecules of the NaCl type are 
to be found in vapours. However, in 
addition to molecules, ions are also 
vaporised from a liquid. 

The situation is entirely different in 
the case of molecules having homo- 
polar, i.e., covalent, bonds. An anal- 
ysis of interatomic spacings encoun- 
tered in molecules shows that spacings 
between atom centres may be calcu- 
lated by means of so-called atomic 
radii. The values of such radii in Angstroms for the most often 
encountered atoms are as follows: 


c— C= C= H `O O= o= 
0 0. 


0.771 0.665 0.602 0.30 0.66 .55 50 


Atomic radii decrease with increasing multiplicity of the valent 
bond. The table shows that the separation between two bound carbon 
atoms, C—C, is 1.54 A; the separation in C—H is 1.07 A, etc. 

In constructing a model of a molecule, we also have at our dispos- 
al certain elementary data on bond angles. The existence of normal 
bond angles is understandable from considerations of symmetry and 
is in agreement with certain qualitative quantum mechanical reason- 
ing discussed in the preceding article. Thus, the normal bond 
angle of a carbon atom that is linked to four atoms is a tetrahedral 
angle (109°28’). In the case of an aromatic carbon atom, as well as 
other carbon atoms that are linked to three atoms, the normal bond 
angle is equal to 120°. Finally, the characteristic bond angle of 
a carbon atom that is linked to two atoms is 180°. 

The normal bond angles of oxygen, sulphur and nitrogen atoms 
which are linked to two atoms in the case of oxygen and sulphur 
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and to three atoms in the case of nitrogen are equal to 90°. The nitro- 
gen atom in the nitro group NO2 has a normal bond angle of 120°. 

In a number of cases, bond angles deviate considerably from the 
“normal”. In certain cyclic compounds of the cyclobutane type, the 
angles are equal to 90° rather than 109°28’. Such deviations are 
due to spatial obstacles. However, before discussing this, we must 
clarify a third geometric characteristic of a molecule, namely, its 
intermolecular radius. 

Investigations of molecular arrangements in crystals have shown 
that each atom may be assigned an intermolecular radius, such that 


Fig. 225 


on the average opislbomane Oe will touch each other, Thus, 
for example, the intermolecular radiyis of hydrogen is 1.17 A, oxy- 
gen—1.36 A, nitrogen—1.57 A, etc. This does not mean, however, 
that the distances between atoms o% the same molecule which are 
not linked by valent bonds are detersiaed by these values. The 
dimensions and the form of a molecule are determined by inter- 
action between the forces establishing equilibrium distances between 
atoms which are not linked by valent bonds and the forces establish- 
ing normal bond angles. Since the bond forces between atoms are 
an order of magnitude greater than the other forces, the interatomic 


distances do not change and the configuration of the molecule is 


determi tition between the elasticity of a bond angle 
a ar sphere of an atom. 


and tl ressibility of the intermolecul e 
Hare. Raat bat graphic example. Experiments ny that 
the bond angle in a molecule of water 1s equal to 105°. me istance 
between hydrogen atoms is 4.54 A. Therefore, considerable compres- 
sion of the intermolecular spheres of the hydrogen Seg Ce 
his compression (2 X 1.17—1.54 = 0.8 Å) is balanced by the 


33-1409 
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elasticity of the bond angle, the normal value of which is equal 
to 90°. Thus, the forces which compress the hydrogen atoms by 0.8 A 
are equal to the forces which change the angle from 90° to 105°. 
Such a simple mechanism explains the difference between the struc- 
ture of a hydrogen sulphide molecule and that of a water molecule. 
Since the length of the hydrogen-sulphur bond is considerably 
longer than that of the hydrogen-oxygen bond, the hydrogen atoms 
of the former molecule are considerably less “crowded”. It turns out 
that in hydrogen sulphide the distance between hydrogen atoms is 
equal to 1.99 A and the bond angle is equal to 92°. The compression 
of hydrogen atoms by 0.35 A is balanced by a change of only 2° 
in the bond angle. Many organic molecules may be used to illustrate 
- the validity of this mechanism. 

Fig. 225 shows the structure of a chlorobenzene molecule. To the 

left is a parbon atom; in the centre, the beginning of the build up, 


viz., a C group; and to the right, a model of the molecule. 
0” Sc 


198. The Electronic Cloud of a Molecule 


Electron motion in a molecule, just as in an atom, is described 
by a wave function. Strictly speaking, the w-function is a function 
of 3n coordinates, where n is the number of electrons in the molecule. 
Then, wp? will give the probability of any electron distribution, 
i.e., the “electron density”. 

It has been noted already that the solution of the Schrédinger 
equation for multi-electron atoms is very complex. In the case of 
molecules, the difficulties are, of course, even greater. Only approx- 
imate, semiempirical methods of calculation are applicable here. 
In this connection, physical methods of determining electron den- 
sity are of particular importance. However, even when such methods 
are used, the results are quite limited. 

The time-averaged electron density of a molecule is determined 
by means of X-ray structural analysis (the electron density gives 
the probability that the electrons are at a given location). As a 
result of the vibrations of atoms inside a molecule, and of the mole- 
cule as a whole, a photograph of an electronic cloud is smeared. 
Fig. 167 (p. 388) shows a cross-section of the electron density pattern 
of an anthracene molecule. The coarseness of the method may be 
gauged from the fact that the hydrogen atoms of the molecule are 
not apparent in all cases. The method used in plotting the pattern 
is similar to that used in the construction of topographical maps- 
Electron peaks and valleys are indicated by lines connecting points 
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with the same electron density. Each atom is represented by an 
electron density “hill”. Superposition of the bell-shaped density 
functions of two atoms along a bond line results in the formation 
of a “bridge” between the atoms. Unfortunately, the accuracy of the 
method is too poor to enable us to determine the nature of the chemi- 
cal bond by measuring the height of this bridge. Its height is indis- 
tinguishable from the sum of the density functions of two free 
atoms. But the specific nature of the chemical bond should probably 
be manifested in an additional increase in electron density (as com- 
pared with free electrons). Such electronic cloud patterns are, there- 
fore, merely interesting illustrations of molecular structure. 

If the electron density with respect to the atomic nuclei of a mole- 
cule were known, we would be able to calculate the dipole moment 
of the molecule. For this purpose, it would be necessary to determine 
the centres of “gravity” of the positive and negative charges. The 
dipole moment has not yet been determined in this manner, although 
comparison of neutronographic (neutrons scattered on nuclei) and 
roentgenographic data could be used to solve such a problem. How- 
ever, the dipole moment of a molecule can be reliably measured 
(see p. 656) and it is then possible to solve the converse problem, 
namely, determine the centre of “gravity” of negative charge by 
means of the dipole moment. 

It would seem that in purely ionic molecules we encounter the 
extreme case in which the centre of gravity of an electronic cloud 
coincides with the centre of an anion. The dipole moment of KCl, 
for example, could then be predicted in the following manner. If one 
electron is taken from a potassium atom and transferred to a chlo- 
rine atom, one “extra” positive charge will be separated from one 
“extra” negative charge by the distance between the potassium and 
chlorine centres, i.e., 1.81 + 1.33 = 3.14 A. Hence, the dipole 
moment will be equal to 3.14 x 4.8 X 10718 = 15 Gaussian units. 
But experiments yield a value of 6.8 Gaussian units. This means 
that even in the case of such a classical ionic bond the potassium 
electron does not go over completely to the anion. On the other 
hand, the other extreme case is fully realised. Evidently, symmetri- 
cal molecules such as Hə, Oz and benzene cannot have a dipole 
moment: the centres of gravity of the electronic cloud and the nuclei 


coinci 
ae property of an electronic cloud should be mentioned, 
namely, its ability to be displaced relative to a nucleus. Electronic 
clouds may be displaced relative to nuclei by means of an electric 
heavier than electrons, it may be as- 


field. Si nuclei are much i 
E eromealel remain fixed. The displacement of the electronic 
cloud of a molecule may be described by the displacement of its 


centre of gravity. When the centre of gravity of negative charge is 
33* 
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displaced relative to the centre of gravity of positive charge by 
a distance z, the molecule acquires an induced dipole moment 
p = Nex, where N is the number of electrons in the molecule. The 


"ou 
>N UW 


Fig. 226 


induced dipole moment in- 
creases linearly with the field, 
i. e., p = BE. It is customary 
to describe the displacement of 
the centre of gravity of an 
electronic cloud by f, the mag- 
nitude of the polarisability of 
the molecule. The quantity P 
has the dimensions of volume. 
The greater the volume of the 
molecule, the greater the value 
of B (see Chapter XXXV). 


199. Energy Levels of 
Molecules 


The energy of an atom 
changes only by one means: a 
change occurs in its electron 
motion, i.e., an electron passes 
into another quantum state. 
The energy of a molecule 
may also change in this man- 
ner, but by other means as 
well. For example, the atoms 
of a molecule vibrate relative 
to one another. The vibrational 
energy is an integral part of 
the energy of a molecule and 
also may assume only a dis- 
crete set of values. Furthermore, 
a molecule rotates as a 
whole. The rotational energy is 
also quantised and a change in 
the state of a molecule may 
result in a change in rotational 
energy. Therefore, the energy 


state of a molecule is described by indicating the state of its elec- 
tronic cloud (electron level), the state of its vibrational motion 
(vibrational level) and the state of its rotational motion (rotational 
level). We deal with three kinds of information — analogous, so to 
speak, to a house number, floor number and apartment number. 
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But which is analogous to a floor number and which to an apart- 
ment number? Which energy levels are separated by large intervals 
and which by small ones? The answers to these questions are con- 
tained in the energy level diagram shown in Fig. 226, which is 
based on experimental results and theory. Two electronic levels, 
e’ and e”, are shown in this figure. Associated with each electronic 
level is a group of vibrational levels designated by a set of v values, 
and associated with each vibrational level is a group of rotational 
levels designated by a set of j values. 

Clearly, the intervals between rotational levels are less than be- 
tween vibrational levels, and those between vibrational levels are 
less than between electronic levels. 

Let us assume that a molecule may have electronic levels at 100, 
200, 300, ... energy units, vibrational levels at 10, 20, 30, ... 
units, and rotational levels at: 1, 2,3, ... units. In such a case, 
a molecule at the second electronic level, first vibrational level 
and third rotational level will have an energy of 213 units. 

Thus, the energy of a molecule may be given in the form 

W = Wer+Woiv+ Wrot- 
The frequency of radiated or absorbed light may always be deter- 
mined from the difference in energy between two levels, i.e., 


y= (AWer + AW ev + AW rot)- 


It would be interesting to examine transitions involving a change 
in only one “kind” of energy. Practically, this is possible only for 
rotational transitions and it is easily seen why this is so. 

Let us investigate the absorption of electromagnetic waves by 
a group of molecules. Beginning with the longest wavelengths, i.e., 
smallest packets of energy hv, we find that the molecules do not 
absorb energy as long as the magnitude of a quantum of energy is 
less than the difference between two neighbouring levels. By grad- 
ually increasing the frequency, we eventually obtain quanta of 
energy that are capable of raising molecules from one rotational 
level to another. Experiments show that this occurs in the micro- 
wave region (at the end of the radio band) or, in other words, in the 
far infrared spectrum. It is found that wavelengths of the order of 
0.4-1 mm are absorbed by molecules. Thus, a pure rotational spec- 


trum may be obtained. 

By further increasing 
Spectrum to become more dev 
the quanta of energy imping! 
high frequency to make molecules pass 
another. It is clear, however, that a pure 
a series of transitions for which the num 


the frequency, We enable the rotational 
eloped, but nothing new occurs until 
g on the substance are of sufficiently 
from one vibrational level to 
vibrational spectrum, i.e., 
ber of the rotational level 


ie. 
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does not change, is never obtained. Transitions from one vibrational 
level to another involve various rotational levels. For example, 
a transition from the zero (lowest) vibrational level to the first may 
be accomplished by molecules from the fourth rotational level to 
the third, the third to the second, etc. Thus, there arises a vibration- 
rotational spectrum, which may be observed in infrared light (3- 
50 u). Clearly, all transitions from one vibrational level to another 
are close to one another and yield a group of very close lines in the 
spectrum. For low resolution, these lines merge into one band. 
Each band corresponds to a definite vibrational transition. 

By increasing the frequency still further, we finally reach a new 
spectral region, which is characteristic of a molecule. This occurs 
in the optical and ultraviolet portion of the spectrum where the 
energy of a quantum suffices for the transition of a molecule from 
one electronic level to another. Here, of course, neither pure electron- 
ic transitions nor pure electronic-vibrational transitions are possi- 
ble. Electronic-rotational transitions, involving a change in “house”, 
“floor” and “apartment”, occur. Since a vibration-rotational transi- 
tion gives rise to a band, the spectrum in the optical region is 
“striped”, i.e., it consists of a system of bands. 


Now, let us discuss the various types of molecular spectra in 
detail. 


200. The Rotational Spectrum of Molecules 


Free rotation of molecules occurs only in the gas state. Therefore, 
basic data on rotational energy levels are obtained by studying gas 
spectra. Investigation of these spectra by optical means is very dif- 
ficult. Much more suitable for this purpose is a radio-spectroscopic 
procedure that has been developed during recent years. A generator 
of electromagnetic waves transmits radiation through a waveguide* 
which is partially filled with the gas under investigation. After pass- 
ing through the gas, the electromagnetic waves arrive at a receiver 
which measures their intensity. This measurement may be performed 
over a large range of frequencies. The width of the band of frequen- 
cies generated by radio methods may be made so narrow that the 
resolving power becomes hundreds of thousands of times (!) greater 
than in the case of optical methods. Optical methods enable us to 
distinguish lines separated by 0.4 cmt, but by radio methods we can 
distinguish lines separated hy 10-8 cm-1**, By means of this high 


* A waveguide is a metallic duct of rectangular or circular cross-section 
through which centimetre radio 


waves may be propagated with practically 
no losses. 


** In spectroscopy, in addition to wa 
use a reciprocal wavelength unit (w: 
per centimetre, 


velength units, it is customary to 
ave number), i.e., the number of waves 
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resolving power, we are able to solve a number of interesting prob- 
lems which are discussed below. A rotational spectrum arises as 
a result of the quantisation of a molecule’s kinetic energy of rotation: 
Io? 


Ja? 


Kyot = 


where J is the moment of inertia of the molecule. This is the form of 
the expression for the energy of a diatomic molecule. This energy is 
described by a single moment of inertia taken about an axis perpen- 
dicular to the line joining the atoms and which passes through the 
centre of inertia. As was indicated earlier (p. 77), in the general case 
the rotation is described by three moments of inertia taken about 
three main axes. 

Briefly, let us consider the rotational spectra of diatomic molecules. 

First, it should be emphasised that not all molecules, including 
diatomic molecules, will yield a rotational spectrum of radiation or 
absorption. As has been explained already (see p. 322), every radia- 
tor or absorber of electromagnetic waves is a kind of oscillator, i.e., 
an elementary dipole. If the atomic motion of a molecule or the mo- 
tion of a molecule as a whole is not accompanied by a change in 
dipole moment, such motion cannot result in the radiation or absorp- 
tion of electromagnetic waves. 

When a molecule radiates or absorbs energy, its dipole moment p 
varies periodically as the oscillation frequency. The dipole moment 
oscillates about an average value, corresponding to the equilibrium 
position of the atoms. It may be shown that the intensity of the spec- ` 
tral lines is proportional to the derivative (E) i.e., the maxi- 
mum rate of change of the dipole moment with respect to interatom- 
ic spacing. All symmetrical molecules the atoms of which are 
joined by homopolar bonds have a constant zero value of p. There- 
fore, they do not give rise to rotational spectra. Such molecules in- 
clude, for example, all diatomic molecules of the same atoms 
(Hz, Oo, Na, ete:). 

Let us consider the rotational spectrum of a diatomic polar mole- 
cule, i.e., a molecule possessing a dipole moment. The rotational 


2 . 
energy of such a molecule is Kyo, = E, here w is the angular ve- 
locity of rotation and J the molecule’s moment of inertia: 


mm2 19 
mpm * 


T=myri + mars = 


where r; and rs are the distances to the centre of inertia and r = ry + 
-+ ry. The value of œ is determined from the fact that according to 
a rule of quantum mechanics (p. 490) the rotational momentum, 
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Im, may assume only the discrete set of values 
h ci Kae er Icy 
5 Vi GFL; 


where j = 0, 4, 2, ... is the quantum number designating the rota- 
tional levels. Therefore, the angular velocities of rotation of a mole- 
cule may assume only the following set of values: 
h Sie 
oj=55VIG+D; 
hence, 


2 REE 
Kr =e = i OGHA). 


Beginning with a zero energy of rotation, the energy of successive 
levels increases in accordance with a square law. 
Energy transitions are subject to a simple selection rule, i.e., 
only transitions between neighbouring levels are allowed (Fig. 227). 
The radiation or absorption frequency 
in the rotational spectrum of a diatomic 
molecule is given by 


hj 5 
va (i= 0,1, 2, ...) 


for a transition between the j and j — 4 
levels. In this simple case, the rotational 
spectrum consists of a system of equally 
spaced lines. »” 

For different gas temperatures, the 
average energy of rotation of a molecule 
differs. In accordance with Boltzmann’s 

Io? 


law, the most probable energy is given by —=-=A4kT (two rota- 


tional degrees of freedom—see p. 195). Thus, the number of 
the energy level at which a molecule is most frequently located may 
be easily calculated. For-example, in the case of a molecule of a 
hydrochloric acid vapour (Z = 2.64 x 10-40 gm cm?*), at tempera- 
tures of 300°, 600° and 1,200°K, we obtain j = 4, 6 and 8, respec- 
tively. 

Since transitions are possible only between neighbouring levels, 
a series of equally spaced frequencies will be grouped about the line 
of “average” j-value. The line intensity decreases as its distance from 
this j-value increases, since the number of molecules in the corre- 
sponding energy state decreases. 

Rotational spectra enable us to determine interatomic distances 
in simple molecules to a very high degree of accuracy (much greater 


Fig. 227 
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accuracy than by diffraction methods). Thus, if the number of atoms 
in a molecule is not large, the distances between atoms may be deter- 
mined if the moment of inertia and the masses of the atoms are 
known. For a diatomic molecule, 


Jb mm: 
r=YV => Where m= — 2 
m my+mys 


In the case of a hydrochloric acid molecule: 
my = 1.67 10-24 gm; mg, =35X1.67X10-24 gm. 
The separation between the H and Cl atoms in an HCl molecule is 


y 2.61 10-40 X36 X 1.67 10-24 
pie 35% 1.67 10-48 


1.63x10-8 cm. 


This value agrees closely with values obtained by other means. 
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This type of spectrum may be observed in a wavelength band 
extending from 2-3 to several score microns. For brevity, the vibra- 
tion-rotational absorption spectrum is referred to as the “infrared 
spectrum”. In the case of solids, where there is no molecular rotation, 
a pure vibrational spectrum is obtained. In the case of liquids, where 
rotation is impeded, the rotational structure of the band is smeared. 
. Diatomie Molecules. Let us disregard rotation for the present 
and consider vibrational energy levels. 

The vibration of a diatomic molecule may be visualised by means 
of a simple model consisting of two spheres joined by a spring. In 
such a system, the natural frequency of oscillations is given by 


2m m 


where i: is the stiffness coefficient determining the binding force and 
mis the mass of an atom when the atoms in the molecule are the same; 
when the masses differ, m is the reduced mass, which is equal to 


y (we leave the proof of this to the reader). Quantum mechanics 
ows 1 the energy of an oscillator is given by the formula 


shows that 
1 
é= (+3) hy. 


Here, — py is the zero-point energy of the oscillator, i.e., the oscilla- 
2 3D. 


tion energy at absolute zero, and v = 0, 4, 2, ... is the oscillation 
quantum number. Moreover, it is shown in quantum mechanics. 
that in the case of harmonic oscillators energy transitions may 


ba 
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occur only between neighbouring levels. In the case of nonharmonic 
oscillators transitions skipping one level or more occur, but these 
are weaker than the main transitions. Harmonic oscillations occur 
under the action of a réstoring force—kz. The potential energy of 
such oscillations is ‘2 , i.e., the shape of the curve is parabolic. 
Fig. 228 shows a potential curve (and an inscribed parabola) for 
a diatomic molecule. The horizontal lines represent energy levels 
based on theoretical calculations. 
For low values of energy, the devi- 
ation of the potential curve from 
a parabola is negligible. Such 
a molecule may be expected to 
obey the harmonic oscillator law 
as long as the vibrational energy 
is much less than the dissociation 
energy of the molecule. Under such 
conditions, the vibrational levels 
may be considered to be equally 
spaced, and since only transitions 
between neighbouring levels are 


allowed, the diatomic molecule 
will possess a single transition frequency. If there is no molecular ro- 


tation, the entire spectrum will consist of a single line. Actually, in 
addition to the main frequency v, the spectrum contains the “over- 
tone” frequencies 2v, 3v, etc. (as the separation between levels 
decreases, the proportional trend of the overtone frequencies is 
lost). However, the overtones are weak and in very many cases we 
have a right to speak of a single vibration frequency. 

The presence of molecular rotation will transform such a spectral 
line into a band. If a molecule vibrates and rotates simultaneously, 
its energy is determined by the two quantum numbers v and j: 


e= (v+) hvo t ii +4). 


The frequencies obtained now fall into two groups, one less than 
and one more than the vibration frequency vp. These groups are 
known as branches and are designated by the letters R and P. 


Taking into account the selection rules discussed above, we obtain 
the following frequency formula: 


Fig. 228 


aay, 
V= Voto E Gayl E 2s, 


The plus sign corresponds to transitions to higher rotational levels 
and the minus sign to lower rotational levels. 
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This is shown in Fig. 229, which illustrates the spectral band of 
HCl. The point O corresponds to Voip, and the vertical lines to the 
right and to the left indicate the obtained frequencies. The height of 
a line is proportional to = 
the intensity at the given R 
frequency. When the res- 
olution is high, each line 
appears distinct. On the 
other hand, when the res- 
olution is low, the lines = 
merge into a band the in- R 


tensity dependence of 7=500%, = 
which is given by the 


T=100°K 


envelope of the spectral 0 
lines. In Fig. 230, we see  7=4000% R fa 
a diagram of the energy AmA 
transitions which produce 0 
this band. It should be Fi 
Fig. 229 


noted that a pure vibra- 
tional transition (from 
j =0to j= 0) is forbidden and as a result there is a gap in 


the middle of the band. There is an absorption maximum to the 
right as well as to the left of the vibra- 
tion frequency. For the reason discussed 
in the preceding article, the absorption 
3 maxima occur for the j-values which 
are most frequently encountered at 
the given temperature. Therefore, as 
j0 the temperature increases, the shape 
of the spectral band changes as shown 

in the diagram. 
g Vibrations of a Polyatomic Molecule. 
A polyatomic molecule may execute a 
3 large number of vibrational motions. 
This number is equal to the number 
2 of vibrational degrees of freedom of the 
„ molecule and may be calculated as fol- 
45678 lows. A molecule consisting of V atoms 
FSP has 3 degrees of freedom. Three of 
— ý wù P them are associated with the coordi- 
nates of the molecule’s centre óf mass. 
Fig. 230 In the general case, the number of rota- 
tional degrees of freedom is also equal to 
ecule- have only two rotational degrees of 
a line passing through the centres of the 


4 


three. But linear mol 
freedom since rotation about 


“s, 
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atoms is physically meaningless. Thús, the number of vibrational 
degrees of freedom and, hence, the number of vibration frequencies 
is equal to 3N — 6 or 3N — 5. If the dipole moment of the molecule 
does not change for a given vibration, the corresponding frequency 
will not be manifested. (We shall return to the problem of so- 
called inactive vibrations later.) Be that as it may, the number of 
vibration frequencies and, hence, the number of bands in the infra- 
red spectrum, is strictly determined by the number of atoms in the 
molecule and by its symmetry. 

In the absence of molecular rotation, i.e., in the case of solids, an 
infrared absorption spectrum consists of lines which correspond 
to vibrational transitions. Since, perforce, thick layers are used in 
such an investigation, there is considerable absorption under normal 
conditions and the lines merge into a band. In liquids, molecular 
rotation is retarded and the rotational structure of such a band will 
be smeared, i.e., individual lines can no longer be detected. 

Now, let us consider the physical meaning of vibrations in a poly- 
atomic molecule. Actually, what kind of vibrations occur? In the 
case of a diatomic molecule the situation was clear, i.e., we were 
dealing with vibrations along a bond line. What quantities vibrate 
harmonically in polyatomic molecules? 

For any molecular vibration, the deviations of atoms from their 
equilibrium positions may be described by displacements along 
a bond and by the distortion of bond angles. The instantaneous con- 
figuration of a vibrating molecule may be completely described by 
(BN — 6) q; coordinates (using the word coordinate in its broad 
sense). For an arbitrary choice of the q; coordinates, their values will 
not obey a simple vibration law. The law of change with respect to 
time of each q; can be represented by a complex, albeit periodic, 
curve. However, it turns out that it is possible to describe a vibrat- 
ing molecule by (3N — 6) Q; numbers which vary harmonically 
with frequencies v;. These Q; “coordinates” are called normal coor- 
dinates and the frequencies v; are called normal vibration fre- 
quencies. 

The fact that it is possible to introduce normal coordinates means 
that the periodic curves of change of any q; coordinates may be 


resolved into spectra of normal vibration frequencies. We can 
always assume that a vibration spect 


l rum consists of normal vibra- 
tion frequencies. ; 

What is the nature of Q; coordinates? Are they obtained only for 
a particular choice of coordinate system? The answer to the latter 
question is no. First and foremost, normal coordinates are linear 
combinations of q; displacements. Therefore, a normal coordinate 
describes the vibration of a molecule as a whole. Examples of nor- 
mal vibrations are illustrated in Figs. 234 and 232 for CO, and H,O 
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molecules. The actual vibration of a molecule is the resultant of the 
indicated motions. 

The normal vibration frequencies of a molecule can be determined 
from its spectrum. These can then be used to obtain a clear picture 


of the molecular vibrations. 0 
The characteristic nature of many 
vibration frequencies is of great prac- AASS oi 
tical importance. Careful study has H Oe oe 
a) > H 
0 c 0 
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shown that basically in certain normal vibrations only one in- 
aries. If a molecule preserves 


teratomic spacing or one bond angle v 
a group of related com- 


that bond, such a frequency varies little in 
pounds. This fact is utilised in chemistry. 

The vibration frequencies of a molecule are measured not only by 
means of infrared absorption spectra but by means of Raman spectra 
as well. As will be seen below, these two methods effectively sup- 


plement each other. 


202. Raman Scattering of Light 


ing refers to the particular case of the scattering of 
light of frequency V by a substance when, in addition to the strong 
scattering of light of constant frequency y, there appears a series 


of lines of lower and higher frequen Ae 
Usually, observations are mace at right angles to the incident 
light. A mercury lamp provides the required radiation. The spectrum 
of this radiation contains several intense lines, the most important 
cof which is a blue line corresponding to a wavelength of 4,358 A. 


By means of a spectrograph, one can obtain a raph 
scattered radiation spectrum. Such a photograph is shown in Fig. 233. 
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The main characteristic of such a spectrum is the following. About 
each excited line there appear identical groups of considerably 
weaker lines. These satellites are usually spaced symmetrically to 
the right and to the left, but they may differ in intensity. This phe- 
nomenon was discovered independently by Raman in India and 
Landsberg and Mandelshtam in the Soviet Union. Raman’s work, 
however, was published first*. Hence, such spectra are called Raman 
scattering spectra. 

The spectral pattern can be explained as follows. Basically, a pho- 
ton hv is scattered by a molecule elastically, i.e., the frequency 

remains constant. However, in addi- 

ssa tion to such scattering, it is also pos- 

AEH Sible to have scattering with some loss 

; of energy; such energy may be expend- 

ed in the transition of a molecule 

from one level to another. Let us as- 

— © sume that a photon hy has lost an 

amount of energy equal to that 

required to raise a molecule from the 

zero Vibrational level to the first level. 

The energy loss is z=: — Gr-0= hvyiy- Therefore, the scattered 

photon has an energy h (v — vj). An associated line or “satellite” 
appears in the spectrum on the side of lower frequencies. 

The lower frequencies, v — vp», ave called red satellites and the 
higher frequencies violet satellites. Scattering with a frequency 
greater than v occurs when a photon hits an excited molecule. In 
such a case, the photon is scattered, but it simultaneously gains the 
“extra” energy due to the transition of the molecule to a lower level. 
If the excited molecule was at the first vibrational level, the photon 
increases its energy by hv,;, and the frequency v + vsi» appears in 
the spectrum. 

This scattering mechanism excellently explains the difference 
between the intensities of the red and violet lines. At room tempera- 


Fig. 233 


ture, most molecules are at the zero level with an energy Win. 

A smaller number of molecules are at the first excited level with an 
3 Ae 

energy -y hvoip. Therefore, it is clear that the intensity of the violet. 


lines must be less. Moreover, at low temperatures the violet lines 
practically disappear. The ratio of the intensities of the violet lines 
to the red lines is proportional to the ratio of the number of atoms. 
in the first state to the number in the zero state. According to the 


* Raman sent a telegram about his discovery to the British journal Nature- 
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Boltzmann law, 


_ 3) vib a 
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This formula agrees excellently with experimental results. 

Thus, the displaced lines of a Raman spectrum are shifted by 
amounts equal (in energy units) to the difference between the energy 
levels of the given molecule. 

We have discussed the case of two vibrational levels, but it is clear 
that the discussion is valid for all energy transitions—pure rotation- 
al, vibration-rotational, etc. The satellite lines closest to the main 
scattering line correspond to the lowest energy transitions. The 
rotational spectrum is located considerably closer to the main line 
than the vibration-rotational spectrum. 

In the case of absorption spectra, the selection rules for the oscilla- 
tion quantum number are the same as those for Raman scattering, 
but the selection rules for the rotational quantum number are differ- 
ent. Transitions are allowed for which 


Av=+1 and Aj=0, +2. 


Thus, the vibration-rotational band consists of a pure vibrational 
line displaced from the excited line by voip and a series of lines dis- 
placed from the excited line by Voie — Wrot and Vvis ++ vrot = 

Raman spectra are usually obtained by scattering from liquids. 
The lines Vpip + 2Vrot appear smeared, but the lines of the pure 


vibration spectrum are distinct. 
Raman spectra have an important Sp 
tra. The measurements are transferred, so to speak, to the visible 


region. The frequencies which were measured directly in the infrared 
spectrum are determined as the difference between the main line and 
the Raman line with approximately the same accuracy. 

dispense with the infrared spectrum. 


It would seem that one could | [ 
However, this is not always the case. In certain respects, the infrared. 


and Raman spectra supplement each other. ast 
What is the difference between the process of wave radiation from 


a molecule and the process of scattering from a molecule? In both 
cases a molecule sends wavelets into space, 1.e., 10 both eee mol- 
ecule behaves during radiation like a dipole. However, in the first 
case a molecule behaves like a dipole in the absence of an eels 
field, while in the second it behaves like a dipole when acte upon 
by the field of an incident wave. Thus, radiation or absorption will 
occur when changes in the state of a molecule Caton: rotation, 
etc.) are accompanied by changes in the jnduced dipole moment, 


advantage over infrared spec- 
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i.e., in the polarisability B. Theoretical calculations indicate that 
such a change should occur when the configuration of the molecule 
passes through equilibrium. i . . 

Lines will occur in an infrared spectrum for vibrations which sat- 
‘isfy the condition 


dp 
(a) ar, 
o 
Raman lines will occur for vibrations which satisfy the condition 


aß 3 
(E) r=0 Æ 9: 

Quite often these conditions exclude each other. Therefore a certain 
vibration may be active in the infrared spectrum but inactive in 
the Raman: spectrum, and vice versa. 

A CO, molecule may serve as an example. One of such 
three vibrations—the linearly symmetrical vibration—leaves the 
dipole moment unaltered and equal to zero. That vibration is inac- 
tive in the infrared spectrum. In the Raman spectrum, on the other 
hand, only that vibration will be active; the other two will be absent. 
In the case of an anti-symmetrical vibration, 
follows: In both extreme positions, the deformation of the electron 
cloud, and hence the polarisability, is the same. During vibration 
the polarisability changes in the same manner in both half-periods, 


and at the equilibrium position passes through a minimum or max. 
imum, but this does not mean that 


(£) 0 0. 


We shall not discuss these regularities any further. They have 
been studied in detail and the results are available in tabular form. 
Such tables enable us to determine from the symmetry of a molecule 
the number of vibration frequencies in-its infrared and Raman spec- 
tra. The converse is also true, namely, the symmetry of a molecule 
can be determined from the number of lines in its spectra. 


a molecule’s 


one may reason as 


203. Electron Spectrum 


Let us consider absorption spectra in which the tr 
the visible and ultraviolet regions. The magnitude 
the incident light will be of the Same order of m 
difference between the electronic levels of a mo 
transitions become possible. However, as has been ir 
electron transitions are accompanied by changes in 
rotational energy. Therefore, 


ansitions are in 
of a photon of 
agnitude as the 
lecule. Electron 
ndicated already, 
vibrational and 
a very broad band is associated with 
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every such transition. Moreover, under normal experimental condi- 
tions this band is continuous, i.e., its “vibration-rotational” struc- 
ture is not discernible. Each electron transition band contains nu- 
merous narrow vibration-rotational transition bands, whereby al] 
changes of the oscillation quantum numbers are possible. ` 

The properties of a molecule in an excited electronic state differ 
from its properties when it is at a zero electronic level. When 


1 — 
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Fig. 234 


a molecule becomes excited, the system of vibrational and rotational 
levels and hence the vibration frequencies, i.e., the differences 
between the vibrational levels, change. Also, the shape of the poten- 
tial curve and the equilibrium spacings between atoms change. 
Absorption curves in the visible and ultraviolet regions are suf- 


ficiently characteristic to be used in identifying substances. 
$ f the absorption of light on the thickness of 


The dependence o y 
a layer of substance may be expressed as follows (cf. p. 120): 


I=", 


F b : inci I the intensity of the 

where Z, is the intensity of the incident beam, cree 

transmitted beam, « the thickness of the layer, and p the absorption 

coefficient for light. The value of the absorption coefficient depends on 
g 


I A 

inci bli rye of — as a functior 

the wavelength of the incident light. A curve n *8? à 

of wavelength is sometimes called an absorption curye. poney 

however, this term refers to a curve of p as a function of À or v. 
, 


34—1409 
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The relation for the absorption of light in solutions may be written 
in the form 


I= Ioe-tnt or I= I-et, 


where | is the distance traversed by the beam in the substance and 
kN and ec are expressions for the absorption coefficient of the solu- 
tion. It is quite reasonable to assume that the absorption coef- 
ficient is proportional to the concentration of the substance, which 
may be expressed as the number of molecules N per unit volume or 
the number of moles of substance c per litre of solution. In the case 
of solutions, the term “absorption curve” usually refers to a curve 
showing the dependence of the coefficient Æ or e on A. 

Examples of absorption curves in the visible and ultraviolet spec- 
trum are shown in Fig. 234. Curve Z is for Congo red, 2—for aniline, 
3—tor phenol and 4—for benzene. 


CHAPTER XXX 
ATOMIC NUCLEI 


204. Experimental Methods of Nuclear Physics 


Investigation of the structure of atomic nuclei is inseparably 
linked with the study of spontaneous and induced decay of atomic 
nuclei and nuclear particles. By studying the fragments of a disin- 
tegrating atomic nucleus and tracing the fate of these particles, we 
can make certain inferences about the structure of the nucleus and 
about the nuclear forces. 

It is not surprising that the spontaneous decay of nuclei, i.e., 
natural radioactivity, was the first to be studied in detail. Concurrent- 
ly, physicists began to study cosmic rays—radiation from outer 
space possessing extraordinary penetrating power. In interacting 
with matter, cosmic particles behave like projectiles. For a long 
time, cosmic ray investigations were the primary means of studying 
transformations of elementary particles and to a certain extent of 
studying atomic nuclei. At present, streams of particles created in 
accelerators are the primary means of studying the disintegration 
of atomic nuclei. 

The experimental methods to be discussed below are applicable 
to the study of cosmic rays and particles created during the nuclear 
bombardment of one or another target. 

Wilson Cloud Chamber. When a fast particle passes through 
a chamber containing supersaturated vapour and creates ions along 
its path, a track is left that is very similar to the “tail” occasionally 
seen in the sky after a plane has passed. This track is produced by 
condensation of the vapour. The ions along the path of the particle 
are centres of vapour condensation; as a result the track is easily 
detected. The track of the particle may be photographed or ob- 
ser ' ? 
Pee beh the vapour in the chamber may be controlled by 
varying the volume of the chamber. This is achieved by means of 
a piston. Rapid adiabatic expansion brings the vapour to a state of 


supersaturation. 4 
Tia Wilson cloud chamber is located in a magnetic field, the veloc- 
ily of a particle in the chamber may be determined from the curva- 
we conversely, e/m may be deter- 


ture of its path if e/m is known 07; 
mined if the velocity is known (see the formula on p. 444), 


34% 
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Jonisation Counters and Jonisation Chambers. An ionisation device 
used in radiation investigations usually consists of a cylindri- 
cal condenser filled with gas. The cylindrical casing constitutes one 
electrode and a thin insulated wire placed along the axis of the cylin- 
der the other (Fig. 235). The magnitude of the voltage applied to the 
condenser and the pressure of the gas in the counter must be chosen 
in a special manner, depending on the nature of the problem. In 
one such widely used device, known as a Geiger counter, a voltage 
equal to the breakdown voltage is applied 
between the cylinder and the wire. When 
an ionising particle enters such a counter 
through the wall or insulator, there passes 
through the condenser a pulse of cur- 
rent which continues to flow until the pri- 
mary electrons and the created electrons 
and ions of the self-maintained discharge 
reach the positive casing of the condenser. 
This pulse of current can be amplified by 
ordinary radio-engineering means and the 
passage of the particle through the coun- 
ter may be determined by a click or light 
flash, or by means of a digital counter. 

A digital counter can be used to deter- 
mine the number of particles entering the 
instrument. This requires that the pulse 

Fig. 235 of current due to one particle cease by 

the time the next particle enters the 

counter. If the operating conditions of the counter are not chosen 

properly, the counter begins to “choke” and count incorrectly. The 

resolving power of an ionisation counter has an upper limit. This 

limit, however, is quite high, i.e., up to 10,000 particles per second 
may be counted. 

If we decrease the voltage, the pulse of current passing through 
the condenser may be made proportional to the number of created 
ions (proportional counter). For this purpose, it is necessary to oper- 
ate in a region where the gas discharge is not self-maintained. Pri- 
mary electrons*moving in the electric field of the condenser 
late energy, whereupon ionisation by collision commences and new 
ions and electrons are produced. The first n pairs of ions produced 
by a particle entering the counter are transformed into $n pairs of 
ions. When the operating conditions are such that the discharge is 
not self-maintained, the amplification factor k is constant. Thus, 
a proportional counter not only establishes that a particle has passed 
through the counter, but also measures the ionising power of the 
particle. A 


accumu- 
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Just like in the Geiger counter described above, discharge in 
a proportional counter ceases when no more ionisation takes place. 
The distinguishing feature of a Geiger counter is that a particle 
entering such a counter behaves like a trigger mechanism, and the 
breakdown time is independent of the primary ionisation. 

Since a proportional counter is sensitive lo the ionising power of 
a particle, the operating conditions of the counter may be chosen 
so that only certain kinds of particles are recorded by the instrument. 

If the operating conditions of the instrument correspond to satura- 
tion current (achieved by decreasing the voltage), the current through 
the counter becomes a measure of the radiation energy absorbed 
in the instrument per unit time. In such a case, the device is walled 
an ionisation chamber. The amplification factor Æ is then equal to 
unity. The merit of an ionisation chamber is its high stability of 
operation. The design of an ionisation chamber varies considerably 
from case to case. The choice of chamber filling, wall material and 
number and shape of electrodes will depend on the purpose of the 
investigation. Chambers vary in size from about a cubic millimetre 
to several hundred cubic metres. Under the action of a constant 
source of ionisation, currents ranging from 107" to 1077 ampere are 
produced in ionisation chambers. 

Scintillation Counters. The method of counting elementary par- 
ticles by the flashes of a fluorescent substance (scintillations) was 
first used by Rutherford in his classical investigations of the struc- 
ture of atomic nuclei. Modern instruments bear little resemblance 
to the simple device used by Rutherford. ; F 

A particle impinging on a phosphor” may produce a flash of light. 
There exist a large number of organic and inorganic substances which 
are capable of transforming the energy of charged particles and 
photons into luminous energy. The duration of afterglow in many 
phosphors is very short—of the order of a thousand millionth of 
a second. This makes it possible to construct scintillation counters 
with large counting rates. The light yield ofa number of phosphors 
is proportional to the energy of the incident Palea: Bae phosphors 
find application in the Cone of counters for the determination 

j o articles. 
i To a phosphors are combined with photomultipli- 
ers having ordinary photocathodes that are sensitive to visible light. 
The electric current produced in a photomultiplier is amplified and 


fed to a counting device. 
The most widely used org 
bene and terphenyl. These ¢ 


anic phosphors include anthracene, stil- 
hemical compounds belong to the class 


- i vhich, generally speakin 
* Phosphors constitute a large group of solids w nE p 8 
do be her pe AniGe in common with the chemical element phosphorus. 


Whe 
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of so-called aromatic compounds, which contain rings of six carbon 
atoms. For use as scintillators, these substances must be obtained in 
the form of monocrystals. Since large crystals are rather difficult to 
grow and since crystals of organic compounds are very fragile, the 
use of plastic scintillators, i.e., solid solutions of organic phosphors 
in transparent plastics such as polystyrene and other such high- 
polymer materials, is of considerable interest. Of the inorganic phos- 
phors, halides of alkali metals, zinc sulphide and tungstates of alkali 
earth metals find application. 

Cherenkov Counters. As far back as 1934, Cherenkov showed that 
when a fast particle moves in a perfectly pure liquid or solid dielec- 
tric a peculiar luminescence occurs. This luminescence basically differs 
from fluorescence, which is related to energy transitions in atoms 
of the substance, and from bremsstrahlung of the continuous X-ray 
spectrum type. Cherenkov radiation occurs when a charged particle 
moves with a velocity exceeding the phase velocity of the propaga- 
tion of light in the dielectric. The distinguishing feature of this 
radiation is that it is propagated along a conical.surface in the prop- 


agation direction of the particle. The angle of the cone may be deter- 
mined from the formula t 


v 
cos 0 = yo 


where 0 is the angle between the surface of the cone and the direc- 
tion of motion of the particle, V is the velocity of the particle, and 
v is the velocity of light in the medium. Thus, for a medium having 
a given index of refraction z, there exists a critical velocity, V = 


c . . A . rer . 
v= i below which no radiation occurs. At this critical velocity, 


the radiation is parallel to the direction of motion of the particle. 
For a particle moving with a velocity very close to the velocity of 


light (V =c), the angle of radiation 0 = arccos = is a maximum. 


In the case of cyclic hexane, n = 1.437 and 0 = 46°. 
Theoretical calculations and experiments show that, in the 
main, the Cherenkov radiation spectrum is located in the visible 
region. 
Cherenkov radiation is a phenomenon similar to the formation of 
a nose wave near a moving ship; in such a case, the velocity of the 


ship is greater than the velocity of the waves on the surface of the 
water. 


Fig. 236 illustrates how Cherenkov radiation is formed. A charged 
particle moves along the axial line and the electromagnetic wave 
following the particle temporarily polarises the medium at points 
of the particle path. All such points become sources of spherical 
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waves. There exists a certain angle for which these spherical waves 
coincide in phase and form a single front. 

Let us consider two points along the path of the charged particle 
(Fig. 237). Two spherical waves have been created—one at the in- 
stant of time ¢ and the other at the instant of time t + T. Clearly, 
q is the time required for the particle to traverse the distance between 
these two points. In order for these two waves to be propagated at 
an angle @ in the same phase, 
the time of travel of the first 
beam must be greater than the 
time of travel of the second 
by t. During the time T, the 
particle traverses a distance 
Vr and the wave a distance vt. 
Thus, we obtain the formula 

A 


cos 0.=—--: 
os 7 
Cherenkov radiation is wide- 
ly used as a means of regis- Fig. 236 


tering elementary particles. 
Counters based on this phenom- 
enon are known as Cherenkov 
counters. Like scintillation 
counters, they contain a lumi- 
nescent material, photomulti- 
pliers and amplifiers of pho- 
toelectric current. Various 
types of Cherenkoy counters 
have been designed. 

Such counters have numerous merits. These include a high count- 
ing rate and the ability to determine the charge of particles moving 
with velocities very close to that of light (the curve of light yield 
as a function of particle charge rises sharply). Only by means of 
Cherenkov counters is it possible to solve such important problems 
as the direct determination of the velocity of a charged particle, the 
determination of the direction in which an extremely fast particle 


Fig. 237 


moves, etc. 

Arrangement of Counters. To study transformation and interac- 
tion processes of elementary particles, one must be able to detect the 
emergence of a particle at a particular point and trace its subsequent 
course. Various arrangements of standard counting circuits are used 
in solving such problems. For example, two or more counters may be 
electrically connected in such a manner that a count occurs only 


when a discharge begins in all the counters at exactly the same time. 
This may serve to indicate that a certain particle has passed through 
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all the counters. Such an arrangement of counters is known as 
a “coincidence” arrangement. 

Nuclear-Emulsion Method. A gelatinous film containing micro- 
crystals of silver bromide may serve as the photosensitive layer of 
a photoplate. Basically, the photographic process consists in the ion- 
isation of these crystals, resulting in the reduction of the silver bro- 
mide. This process occurs under the action of light or of charged par- 
ticles. A concealed track is formed in the emulsion when a charged 
particle passes through it. This track may be seen after the photo- 
plate is developed. Photoemulsion tracks tell us a great deal about 
the particles producing them. Thus, strongly ionising particles leave 
heavy tracks. Moreover, since the ionisation produced depends on 
the charge and the velocity of the particle, considerable information 
may be obtained by simply examining the appearance of the track. 
The range of a particle in the photoemulsion is also a source of valu- 
able information. By measuring the length of the track, one may 
determine the energy of the particle. 

Ordinary photoplates, having- thin layers of emulsion, are hardly 
suitable for nuclear investigations. Such plates would register only 
those particles the motion of which is strictly in the plane of the 
plate. Misovsky and Zhdanoy in the Soviet Union, and several years 
later Powell in Britain, introduced the use of photoplates having an 
emulsion thickness of almost 1 mm (one hundred times greater than 
the thickness in ordinary plates). The com plex transformations 
occurring when a particle disintegrates are registered in clear visual 
form in the photographic method. 

Fig. 238 shows a typical photograph obtained by this method. 
Nuclear transformations have occurred at P and S. 

In a recent modification of this method, an emulsion chamber of 
considerable volume is used as the medium for registering the parti- 
cle track. 

Methods of Analysing Observations. Using the described instru- 
ments, an investigator is able to determine the most important con- 
stants of an elementary particle, viz., velocity, energy, electric 
charge and mass. All these parameters may be determined very accu- 
rately. Moreover, when a stream of particles is available, it is possi- 
ble to determine the spin and magnetic moment of an elementary par- 
ticle. For this purpose, a magnetic field is used to divide the beam 
(see p. 495). 

It should be recalled that only charged particles can be observed 
directly. Our knowledge of neutral particles and photons is obtained 
indirectly, i.e., by determining how these invisible particles 
affect charged particles. Nevertheless, our knowledge of invisible 
particles is highly reliable. 
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The laws of conservation of momentum and energy have wide 
application in the investigation of elementary particle transforma- 
tions. Since we are dealing with fast particles, changes in mass must 
pe taken into account in applying the law of conservation of energy. 

Consider a particle track in which “branching” occurs. This signi- 
fies that the first particle has been transformed into two particles. 
Therefore, the following relations must hold. First, the momentum of 
the first particle must be equal to the vector sum of the momenta 
of the created particles: 


Pı = P2 + Ps- 
Secondly, 


Kı=K: +K; +46, 


where AG = Am (the increment Am is the difference in mass be- 
tween ms + m and m). 

Nuclear physics experiments show that the laws of conservation 
are strictly obeyed in all elementary particle transformations. Hence, 
these laws may be used to explain the properties of neutral parti- 
cles, which do not leave tracks in a photographic emulsion and do 
not ionise a gas. When an investigator observes two diverging tracks 
on a photoplate, he knows that at the branching point a neutral 
particle transformation has occurred. By determining the momen- 
tum, energy and mass of the created particles, he may reliably 
ascertain the values of the parameters of the neutral particle. This 
is how the neutron was discovered and how neutrinos and neutral 
mesons are studied (see below). 


205. Nuclear Particles 


Atomic nuclei consist of protons and neutrons. The basic character- 
istics of a proton, like any other elementary particle, are its charge, 
mass, spin and magnetic moment. A proton has a positive elemen- 
tary electric charge, i.e., its charge is equal in magnitude, but 
opposite in sign, to the charge of an electron. Its mass is equal to 
1.6724 x 10-** gm, which is 1,836 times greater than the mass of 


7 7 ; 1 F A 
an electron. It has a spin of z and a magnetic moment of 1.41 X 
x 107%? Gaussian unit. 
A neutron has a somewhat greater mass than a proton, i.e., its 
mass is equal to 1.6737 x 10- gm. It also has a spin of = . Its mag- 


netic moment is anti-parallel to the spin and equal to 0.966 x 1072 
Gaussian unit. 


A neutron does not have an electric charge and does not leave 
a track in a Wilson cloud chamber or on a photoplate. The properties 
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of a neutron are primarily determined by studying collisions between 
neutrons and various nuclei. Knowing the mass and velocity of 
a nucleus which collides with a neutron, one can determine the 
velocity Vnew and the mass Mneu Thus, in accordance with the laws of 
elastic impact (see p. 67), we obtain 


veer ep Mneut__ 
nee Mneut +Mnuet 


where Mneu and Vneut are unknown quantities. By studying colli- 
sions between neutrons and various nuclei, one can roughly determine 
M wens 12 LENIS assumed that the initial velocity Uneu is the same in 
different collisions. A precise value for M neut May be determined from 
the values of the mass defect of nuclear reactions (see below). 

The spin and magnetic moment of neutrons have been directly 
determined from very interesting measurements on a stream of neu- 
trons passing through magnetised iron. However, we shall not de- 


scribe these measurements. 


Uneuts 
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The number of protons Z in the nucleus of a given element deter- 
mines the chemical properties of the element, and the position of 
an atom in the Mendeleyev periodic table is determined precisely 


by the number of its protons 2. r i 
A chemical element may have several isotopes, which differ from 
one another with respect to the number of neutrons m the nucleus. 
An isotope of a ‘given element may be described by its mass num- 
ber M, the total number of protons and neutrons in its nucleus*. 
Thus, the number of neutrons in a nucleus is equal to M = Z. 
Ordinary natural substances are a mixture of isotopes. The isotop- 
ic composition of a natural substance generally remains fixed and 
is, therefore, characteristic of the given chemical Seta, 
ly, one of the isotopes of the mixture predominates. or Paman 
hydrogen is encountered in nature in the form of anay hydrogen 
H? and deuterium H =D, whereby the percentage of tie ortir 
is 99.98% and that of the latter 0.02%. The per nE o in 
natural oxygen is 99.76%. In natural uranium the main isotope 
is U2; its percentage ÍS 99.28%. 
al element is denoted by the 
number is indicated by a superscript to the 
right. A subscript to the left is frequently used to indicate the atomic number 


Z is is necessary since the chemical symbol deter- 
Z of the element, but this iS aa, of oxygen may be denoted by the symbols 07°, 


ines Z. T see isotopes | ; ) 

PAON the i and 3018. The nuclei of these isotopes contain 8, 9 
a s016, . The 
and 10 neutrons (M — 2) respectively- 


The nucleus of an isotope of a given chemic 


symbol of the element. The mas 
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Let us represent the mass of the lightest and most prevalent isotope 
of oxygen by Mo. The magnitude Mo is called an atomic weight 
unit. It is customary to express the atomic weight A of isotopes and 
elements in such relative units. Precise measurements indicate that 
an atomic weight unit is equal to a mass of 1.662 x 40-24 gm. The 
absolute value of the mass of an isotope may be determined from 
the formula. 


M4 = 1.662 x 10-24 A om. 


The mass of a proton is 1,836 times greater than the mass of an 
electron. Therefore, the mass of an atom and the mass of its nucleus 
are almost equal. However, in a number of cases, particularly for 
light atoms, this difference may be determined by means of modern 
measuring methods and should be taken into account. It is evident 
that the following relationship exists between the mass of an atom 
M a and the mass of its nucleus My. 


My=M,—Znm. 
In atomic weight units, m = 5.5 X 10-4* 
tween M y and M , is of the order of sev 
of these masses (for heavy 
sandths of a per cent). 

The relative atomic weight of an isotope is approximately equal 
to its mass number. For example, the mass of H! is equal to 1.00812, 
of D? to 2.01472, of Ne? to 19.9981, ete. 

The following important conclusion may be drawn from a careful 
study of a table of isotope masses: the mass of a nucleus is less than 
the sum of the masses of its constituent particles. For example, the 
mass of a neutron is 1.00893 and the mass of a proton is 1.00812; 
hence the sum of the masses of two neutrons and two protons is equal 


to 4.0341; but the mass of a helium atom, consisting of two neutrons 
and two protons, equals 4.0039. Thus, the mass of a helium nucleus 
is 0.0302 of an atomic weight unit less than the sum of the masses 
of the constituent particles of the nucleus. This value is a thousand 
times larger than the accuracy of the measurements 

This difference between the sum of the masses of the constituent 
particles of the nucleus and the mass of the nucleus is an important 
example of mass defect. Every nucleus has a specific mass defect. 

One of the most important conclusions of the theory of relativity 
is the principle of the equivalence of mass and energy (see p. 424). 
This principle states that if a system acquires or loses a quantity of 
energy A@, the mass of the system increases or decreases, respec- 


- Thus, the difference be- 
eral hundredths of a per cent 
atoms—of the order of several thou- 


* is * . CO E: T 
1.062% ton obtained by dividing the mass of an electron in grams by 
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tively, by Am = A@/c?. The mass defect of a nucleus is easily 
explained by means of this principle, i.e., it is a measure of the bind- 
ing energy of the nuclear particles. 

Let us see what this means. In chemistry and physics, binding ener- 
gy is understood to mean the energy required to completely break 
a bond. If a nucleus could be divided into its constituent parti- 
cles, the mass of the system would increase by the value of the mass 
defect Am. From the viewpoint of Einstein's law, this means that 
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Fig. 239 


the nucleus has been given an energy AG = c*Am, which is equal 
to the binding energy. Hence, a change in mass of one atomic weight 
unit is equivalent to a change in energy of 


1.662 x 10-24 x 9 x 102 erg= 1.496 x 10 erg = 931.8 Mev 
(1 ev = 1.601 x 107? erg and 1 Mev = 10° ergs). Using these val- 


ues and knowing the values of the mass defect, one can easily calculate 
the binding energy of an atomic nucleus. i ’ 
Fig. 239 shows a curve of binding energy per nuclear particle, i.e., 
c?A m/M, as a function of mass number. It is seen that the binding 
icle first rises rapidly, although not quite 


energy per nuclear parti 
uniformly, then remains at approximately 8 Mev, and finally drops 


slightly for the last elements in the Mendeleyev periodic table. The 
following conclusion may be drawn from the fact that the energy 
remains constant at 8 Mev over a large portion of the curve: Since 
the binding energy per particle does not depend on the total number 
of particles in a nucleus, interaction in a nucleus must occur only 
between particles that are very close to each other. It follows that 
nuclear forces between particles become effective only when one 
particle approaches another very closely (see below). 

It is instructive to compare the 8 Mev value with the chemical 
binding energies of molecules. The latter are usually equal to sever- 
al electron volts per atom. Thus, the energy required to break up 
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a nucleus is several million times greater than the energy required 

< molecule into atoms. À 
D eee will be discussed in greater detail below. It is 
already clear from the above examples, however, that these forces 
reach tremendous values when a nucleus breaks up. It is also clear 
that nuclear forces constitute a new kind of force, since they can 
bind together particles the electric charges of which are of the same 
sign. Nuclear forces are not reducible to electrical forces. 
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Substances in which atomic nuclei are continuously disintegrating 
are said to be radioactive. Two kinds of radioactive disintegration 
exist: a-decay and B-decay. 

Alpha decay consists in the ejection of an a-particle from an atom- 
ic nucleus. Since an a-particle consists of two protons and two neu- 


trons, its symbol is „Het. Thus, a-decay may be represented as 
follows: 


EM =, BI‘ + Hes, 


where El is an arbitrary chemical element. This disintegration reac- 
tion, as well as other nuclear reactions to be considered on p. 567, 
obeys the law of conservation of charge (the sum of the scripts in 
the right member of the equation equals Z) and the 1 | conserva- 
tion of mass number (the sum of the superscripts in the right mem- 
ber of the equation equals M). 

Beta decay consists in the ejection of an ordinary electron (RIOR 
a positron (B+) from a nucleus. It should be recalled that the masses 
as well as the magnitudes of the charges of these two particles are 
equal. The ejection of such a light electrically charged particle 
from a nucleus results in the nuclear transformation of a proton into 
a neutron or a neutron into a proton. Such a transformation ensures 
the conservation of electric charge upon disintegration. 

Beta decay may be represented as follows: 


EI" =p El" ote pit; == Se ff + Bat 5 


Thus, for B--decay, a neutron of the nucleus is transformed into a pro- 
ton (the number of protons increases); for B*-decay, the reverse 
transformation takes place. 

In addition, y-rays, i.e., electromagnetic radiation of shorter 
wavelength than X-rays, may be emitted in both types of radio- 
active disintegration. Alpha radioactivity occurs only for the heavy 
elements (beginning with bismuth); B--radioactivity is encoun- 
tered considerably more often than B*-radioactivity. 


Fat 
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A radioactive substance found in nature is said to be naturally . 
radioactive; a radioactive substance obtained by means of nuclear 
reactions is said to be artificially radioactive. 

If another element is formed when the nucleus of a radioactive 
clement decays, and if a third element is formed from the second, 
ete., the sequence of such elements is called a radioactive series. Four 
radioactive series, beginning with US, Th}, U®5 and U?8*, are 
known. 

Radioactive decay obeys the law 

N= Np, 


where Vy is the number of nuclei present at the initial instant t = 0, 
N the number of nuclei remaining (not disintegrated) after the elapse 
of a time t, and A a radioactive-decay constant, which is a constant 
for a given element. 

It is easily seen that the time interval 7 during which half of 
the number of atoms present decay is given by 


2 0.693 
f=h—== 5 


The time interval 7 is known as the half-life period or simply 
half-life of thè radioactive element. 

The predecessors of the naturally radioactive series have a half- 
life lying in the range of 105-410 years. On the other hand, the half- 
life of intermediate decay products and artificially radioactive ele- 
ments may be equal to an extremely small fraction of a second. 

A quantity of radioactive matter could be expressed, of course, in 
grams. However, it is easier and more convenient to describe a quan- 
tity of radioactive matter by its activity, i.e., the number of decay 
events per second. The curie is a historical unit of measurement 
which is equal to 3.7 X 410! decays per second. In laboratory work, 
this unit is found to be inconveniently large; hence, the millicurie, 
one-thousandth of a curie, is often used. Another unit used is the 
rutherford, which is equal to 10° decays per second. Thus, 4 milli- 
curie = 37 rutherfords. 

If the half-life 7 of a substance is known, its initial radioactivity 
can be easily determined. The fraction of substance decaying in 


1 second is equal to 


N h 
a Y E 
1 M il 
or, since A is a small number we obtain, 
SN. 0.693 
1 EM =F 


* U233 has predecessors among the transuranic elements. 
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In the case of m grams of substance of atomic weight A, the activity 
is equal to 
0.693 m decays 
T ^AX1.66x10721 sec 


It is generally assumed that the radioactive products found during 100 days 
of operation of a nuclear reactor (see below) amount to 1 curie per watt. For 
500,000 kW, the quantity of decay products is equal to about 500 gm, i.e., 
10-76 gm per watt. Taking the average atomic weight of the fission products as 
100, we find by means of the above formula that the average half-life of the 
radioactive products is equal to 105 seconds, i.c., about 24 hours. 


We cannot describe fully the properties of a radioactive substance 
by means of its half-life and activity alone. We must indicate, in 
addition, whether the substance is an 
a-particle or f-particle radiator and 
U 3 : bA ‘ 
whether the decay is accompanied by 
y-radiation. An even fuller descrip- 
tion requires that we specify the energy 
: of the particles ejected from the nuclei 
and the energy of the radiation. The 
properties of œ-particles radiated from 
one radioactive material differ very 
T little from the properties of «-particles 
radiated from another radioactive 
material. The initial velocities of such 
a-particles lie in the range of 15,000- 
20,000 km/sec and the number of pairs 
of ions formed in air by such an g- 
Fig. 240 particle lies in the range of 1x 10°-2 x 
X 10°. The energies of -particles eject- 
ed during decay are distributed con- 
tinuously from zero to several hundred or thousand kev. The ener- 
gies of y-rays emitted by one radioactive substance differ from the 
energies of y-rays emitted by another radioactive substance, but 
their order of magnitude remains the same for all elements. 
Now, let us direct our attention to the theory of œ- and fi-decay. 
In alpha decay, an g-particle tunnels through a potential barrier 
and is then subjected to electrostatic repulsion. The potential curve 
of a nucleus is illustrated in Fig. 240. We see a potential well and 
a potential barrier (beyond the barrier the electrostatic potential 
energy drops hyperbolically). It has been shown in the case of a radio- 
active element by scattering «-particles from nuclei of this ele- 
ment that the height of the potential barrier is at least 9 Mey. Never- 


theless, particles having an energy of only 4 Mey escape from nuclei 
by tunneling through the barrier. 
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Such a model explains why one radioactive element has a very 
short half-life and another has a very long half-life. If the difference 
between the energy of an @-particle in a nucleus and the height of 
the potential barrier changes even slightly, the probability of 
a-particle leakage through the barrier changes radically (see the 
formula on p. 485). 

Now, let us consider B-decay. The existence of the neutrino, a par- 
ticle which eluded detection for a long time, was first hypothesised 
in explaining this process. 

Two arguments immediately present themselves in support of the 
existence of the neutrino. First, such a particle is required to satisfy 
the law of conservation of angular momentum and, secondly, it is 
required to satisfy the law of conservation of energy. We know that 
neutrons, protons, electrons and positrons have a spin of 4/5. As has 
been indicated, B-decay in an atomic nucleus transforms a proton 
into a neutron, or vice versa. Since the number of nucleons in a nu- 
cleus remains unchanged, -decay cannot transform an even spin into 
an odd one. But this is precisely what would be required if only an` 
electron, having a spin of */., were ejected during B-decay. This con- 
tradiction was resolved by hypothesising the existence of the neutri- 
no, a particle having a spin of 1/5. In addition, by means of the neu- 
trino, we can explain why the f-particle spectrum is continuous. 

If B-decay consisted in the ejection of only an electron, this elec- 
tron would have a well-defined energy since the initial and final 
energy levels, i.e., the energies of the primary nucleus and the new 
nucleus, are well defined. But as has been indicated, a continuous 
spectrum of electrons, from a maximum velocity down to zero, is 
obtained. Such a spectrum can be explained by assuming that dur- 
ing disintegration two particles are ejected from a nucleus in accord- 


= 
ance with the equation n => p + e + v. The energy is divided be- 
tween the electron (or positron) and the neutrino in a random manner. 

The detection of a neutral particle having a negligibly small mass 
(we now know that the mass of a neutrino is less than 0.002 of the 
mass of an electron) is an extremely difficult problem. This problem 
was not solved until 1956. 

We have indicated that a neutrino is formed when a neutron disin- 
tegrates. A particularly large number of such decay events should 
occur in a nuclear reactor, where a tremendous number of nuclear 
fragments, rich in neutrons, are continuously formed. If neutrinos 
exist, a stream of such particles should emerge from a reactor. 
When a neutrino collides with a proton the following reaction should 
+> et +n, i.e., a positron and a neutron will be 
actions should be observed in targets containing 
gen atoms (i.e., protons) if they are placed 
tor. This reaction should occur very rarely 


- Occur: v +p 
formed. Such re 
large numbers of hydro 
close to a nuclear reac 
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(several times per hour) since a neutrino has extremely high penetrat- 
ing power. At the same time, a large number of other nuclear react- 
ions occur close to a reactor. 

The difficulties involved in detecting a neutrino are evident. They 
may be overcome by properly utilising the distinguishing features 
of this reaction. We know that the positron is quickly annihilated 
with an electron of one of the target atoms and that such annihila- 
tion yields two photons. The neutron, after covering a certain distance 
in the target, is absorbed by one of the impurity atoms (cadmium) 
added to the target for this purpose. The average life-time of a neu- 
tron before being absorbed has been calculated. It is equal to ap- 
proximately 5 u sec. The absorption of a neutron by cadmium is accom- 
panied by- y-radiation. Using modern measuring techniques, the 
experimenter must distinguish the following sequence of events 
from all others: the creation of two photons, followed in 5 p sec by 
a stronger pulse of y-radiation. Since this has been achieved, the 
existence of the neutrino may be considered to be a fact. 

Alpha and beta decay obey the following formula of decay as 
a function of time: M = Noe. Thus, the decay of a nucleus is an 
independent event that does not affect the behaviour of other nuclei. 
All nuclei have the same decay probability. Assume that half of 
the nuclei disintegrate during an interval of time T. Since the remain- 
ing half is subject to the same conditions as the original group of 
atoms, half of the remaining half disintegrate during an equal inter- 
val of time. The fact that the decay of a nucleus is not dependent on 
the behaviour of its neighbours means that in a given time interval 
At the fraction of the number of atoms present which disintegrate, 
136%, P will always be the same. This statement may be 
expressed as follows: 


AN 
eo —M. 
By integrating this expression, we obtain the exponential law of 


decay. 
It is useful to remember that the reason why exponential laws are 


encountered so frequently in physics is that they are the mathemati- 
cal expression for decrease in accordance with the widespread rule 
that for equal changes in argument a function decreases by the same 
fraction of its magnitude. 


208. Spin and Magnetic Moment of a Nucleus 


Nucleons, the components of a nucleus, have a spin and hence 
a magnetic moment. Thus, the presence of spin is not peculiar to an 
electron alone. Elementary particles may have spin, and a visua 
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interpretation of this fact is not only unnecessary, but incorrect. 
We have indicated already that the model of a particle rotating 
about its axis is completely without basis since the spin of a parti- 
cle cannot be interpreted classically. 

The angular momentum of any particle and, hence of an atomic 
nucleus, is given by : 


=a Sa ET h 
Vs(s+1)x =a ’ 


and the projection of the spin on the selected direction line may 
assume 2s -+ 1 values in the interval s to — s. Usually, it is not the 
above expression which is called spin, but the number s determining 
this expression. 

In accordance with the laws of quantum mechanics, the difference 
2s between the largest and smallest values of spin must equal a whole 
number or zero. Therefore, the spin of a particle may equal 
1 { 3 
Se A 
Neutrons and protons, just like electrons, have a spin of +. 


Examining tables of spin values for various atomic nuclei, we 
observe a number of interesting regularities. Thus, nuclei consisting 
of an even number of protons and an even number of neutrons (He, 
c12, O16) have zero spin. Evidently, the number of nucleons equal 
to a multiple of four generally plays an important role. In many 
cases, but by no means in all, the spin of an atomic nucleus may be 
determined in the following manner: the number closest to M which 
yields a whole number when divided by four is subtracted from M 


and the result is multiplied by Re . For example: Li® has a spin equal 


0, 


to 24 = 1, L? —Ž, BY — 1 and BY — $. 
There are no exceptions to the following, rather obvious, rule: 
the spin of a nucleus for which M is even is equal to a whole number 


or zero, and the spin of a nucleus for which M is odd is equal to an 


odd multiple of 1/2. J ' } l 
ic nucleus may be determined from the hyper- 


_ The spin of an atomi 
fine structure of its optical spectrum. Even though the energy levels 
{ferences in levels may be meas- 


split to a very small extent, the di 
ured very accurately. Splitting occurs as a result of the fact that 
different’ mutual orientations of electron spin and nuclear spin 
Correspond to different energies. s : 
Available data on nuclear spins indicate that the Pauli exclusion 
Principle is applicable to the protons and neutrons of nuclei. Two 
identical particles can be located at a single energy level only if 
their spins are anti-parallel. Since a proton differs from a neutrom 
35* 
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two protons and two neutrons may be located at a single level. 
This compact group of four particles, having a total spin of zero, 
will be recognised as the nucleus of a helium atom (a-particle). 

If a particle has spin, it also has a magnetic moment. The 
angular momentum JL is directly proportional to the magnetic 
moment M, but the magnetic moment may be either parallel or anti- 
parallel to the spin. f 

If the spin of a particle (simple or complex) is given by s, the 
particle’s magnetic moment may be written in the form 


M= gus, 


Ca ee l I : 
where u is an elementary magneton equal to ca , m is the mass of 
AT 


the particle, and g is a dimensionless factor. This equation is the 
general form of the relation given on p. 498 for the case of an elec- 


tron. For such a particle, s =i and g must be taken equal to 2 in 
order to obtain agreement with available experimental data. 

Different particles (elementary as well as complex) have different 
values of g. For example, the g-factor of a neutron is equal to 3.8206 
and that of a proton is equal to 5.5791. 


The value of an elementary magneton depends on the mass of the 
particle. However, it is customary to use only two magneton values: 
the Bohr magneton for light particles and the nuclear magneton (the 
magneton calculated for the case of a proton) for heavy particles, 


1 m : 
whereby HN = [gag HBonr- The values of the g-factor given above 
are calculated for py. 


A theory of g-factors and magnetic moments which relates these 
properties of a nucleus to its structure does not exist. 


209. Magnetic Resonance 


Assume that a substance containing particles of spin s and magnet- 
ic moment M is placed in a constant magnetic field of intensity H? 
The potential energy of such a particle is equal to the scalar product 
MN = M,H. According to the general law of quantum mechanics, 
this energy may assume only a discrete set of values corresponding 
to 2s + 1 possible orientations of spin and magnetic moment. 

As usual, the resulting system of energy levels is determined by 
the energy transitions. The selection rules allow transitions between 
neighbouring levels only if the difference between their s values is 
equal to one. Assume, for example, that in one state 


M= gus 
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and in another 
M,=gp (s—1). 
Hence, the difference in energy is 
y 2—6 =gpH. 
The energy levels will be equally spaced. 


The frequency of a radiated or absorbed quantum of energy corre- 
sponding to the calculated difference in levels is 


— SH 
Ve H. 


For an electron 
v= 2.8 x 10°H 

and for a proton 
vy = 3.46 x 10°H. 

It is seen that for each value of H there is a corresponding charac- 
teristic frequency known as the magnetic resonance frequency. For 
the realisable range of field intensities, these frequencies lie in the 
radio band: in the case of nuclei, in the short and ultrashort wave- 
length region; in the case of electrons, in the centimetre wavelength 
region. i 

Experiments and theoretical calculations show that in practice 
it is not possible to detect the radiation corresponding to these 
frequencies. However, it is possible to observe resonance absorp- 
tion of electromagnetic waves of corresponding wavelength. For 
this purpose, the substance is placed in a coil connected to a high- 
frequency generator and the coil is then placed in a constant magnet- 
ic field. The resonance may be “trapped” by varying the field inten- 
sity while the frequency is kept constant or by varying the frequency 
while H is kept constant. Magnetic resonance is extremely selec- 
tive. The width of the absorption peak is of the order of 0.4 mc/s 
at 460 me/s. 


The magnetic resonance method is widely used in studying vari- 


ous substances. Observation of electronic as well as nuclear resonance 


is of great interest. The presence of electrons having spins which 
are uncompensated for indicates to the chemist that so-called free 
radicals are present and enables him to determine the nature of 
chemical bonds. The chemical composition of a substance may be 
determined by means of nuclear resonance. But the following impor- 
tant fact should be noted. The magnetic resonance effect is so sensi- 
tive as to enable the detection of the superposition of an atom’s 
electronic cloud field on the external field. It turns out that the na- 
ture of this supplementary field depends on the properties of the chemi- 
cal bond between a given atom and the others. Thus, the resonance 
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frequencies of a given atom vary somewhat depending on the chemi- 
cal bond. This phenomenon is known as chemical displacement. 

In Fig. 241, we see an oscillogram of the absorption spectrum of 
a chemical compound. This illustrates magnetic resonance for fluo- 
rine nuclei. We see four peaks, one of which is three times as high as 
the other three. In the molecule whose structural formula is shown 
in the figure, four “different” fluorines are to be seen. There are three 
_ times as many fluorine atoms in the CF, group as each of the other 
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Fig. 241 


three “chemically different” fluorine atoms in the molecule. Chemi- 
cal displacement has separated the nuclear resonances of the fluo- 
rine atoms and made it possible to determine the structural formula 
of this compound. 

Thus, nuclear resonance provides us with anew means of chemical 
analysis. Instead of obtaining only a gross chemical formula (in our 
example, the total number of fluorine atoms relative to, say, hydro- 
gen atoms), we can now obtain a detailed picture of a chemical for- 


a i.e., the proportions of differently bound atoms of a single 
kind. 


210. Quadrupole Resonance 


The scheme of molecular energy levels discussed in the preceding 
chapter lacks certain detail. It turns out that each rotational level 
has an infrastructure. Between the electronic cloud of a molecule 
and the atomic nuclei there may exist yet another interaction: an 
atomic nucleus may possess an electric quadrupole moment and 


depending on the ori i i : 
entat ‘| 2 the 
electronic cloud ion of the atomic nucleus relative to th 


the molecule may S i 4 ri 
$ possess different values of er ergy- 
The magnitudes of the energy associated with this interaction are 
quite small and the correspondin Y “ Ar’ we) aped arou id 
: . : g energy levels are grou 4 I 
Thus, to describe a molecule one i i q ] 
5 , must indicate its quac rupo 
e energy level in addition to its e i vi i 5 
z lectro VIDT: x 
f nic, vibrational and rota 
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Quadrupole interaction does not always exist. If it does, the rota- 
tional transitions discussed above are in fact rotational-quadrupole 
transitions. Pure quadrupole transitions, i.e., transitions between 
separate quadrupole levels, may be observed. Moreover, rotational- 
quadrupole transitions may be resolved. Both problems can be 
solved by radio-spectroscopic methods. Pure quadrupole transitions 
lie in the range of 41-800 mc/s, i.e., in the short-wavelength band. 
Rotational-quadrupole transitions may be detected by studying the 
absorption of microwaves (millimetre waves) in gases. 

Pure quadrupole transitions are of primary interest. They can 
be observed in solids and in certain liquids. 

The following formula gives the energy of interaction between an 
atomic nucleus and the electronic cloud of a molecule for the case of 
an axially symmetrical field (such a field exists in all linear mole- 
cules): S 

3m2—s (s+ 1) . 
6 =€Q9FeQs—1)_ ; Ti 
here, Q is the quadrupole moment of the nucleus and q = 
is the second derivative of the electric potential along the axis of 
symmetry of the field. This effect does not occur in the case of nuclei 
having a spin of 0 or2/.. Also, no interaction occurs when the elec- 
‘tronic cloud surrounding a nucleus is spherically symmetrical. 

There are a limited number of levels. If the selection rules are 
taken into account, one finds that the number of possible transitions 


Thus, one line occurs when s= 1 ands=-, two 


is quite small. 


A quadrupole absorption spectrum can be obtained using a genera- 
tor the frequency of which is continuously varied in the wavelength 
interval under investigation. The resolving power of radiospectro- 
scopic methods is very high. In the case of a spectral line the fre- 
quency of which is of the order of 30 mc/s, the line width is equal to 
several hundred cycles per second. 

The electric quadrupole moment Q is a constant of an atomic nucle- 
us. It describes the deviation from spherical symmetry of the dis- 
tribution of electric charge in a nucleus. The quantity Q (in apare 


A 5 s 7 
lines when s= 5: and three lines whens = = 


centimetres) can be determined from the above formula if 4 = -zz 
e frequencies can be measured. The 


deviation from spherical symmetry of the distribution of electric 
charge in a nucleus can be determined to a first approximation by 
representing the nucleus as an ellipsoid of revolution. If the nucleus 
is elongated in the spin direction, then Q > 0; if it is elongated per- 


pendicular to the spin, then O20; 


is known and the quadrupol 
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An ellipsoidal nucleus tends to become oriented in a very definite 
manner in the field of an electronic cloud. The main energy level 
corresponds to an arrangement in which the axis of symmetry ol 
the field and the axis of the ellipsoid coincide. In excited states, in 
view of the discreteness of energy, the axis of the ellipsoid can as- 
sume only several selected orientations relative to the axis of symmetry 
of the field. The energies of these quantum states is given by the 
above formula. An electromagnetic wave impinging on a molecule 
is absorbed if the magnitude of the photon corresponds to the 
energy of transition from one orientation of the ellipsoid to 
another. 

The study of quadrupole spectra began quite recently. Such spec- 
tra are of great scientific interest as can be gauged by the fact that 
they enable us to measure frequencies with extremely high accuracy 
and that the quadrupole frequency will react to yery small changes 
in the electric field of the molecule of which the nucleus is a compo- 
nent as well as of neighbouring molecules. s 

Suffice it to say that quadrupole frequencies will differ percepti- 
bly in crystalline varieties of one and the same substance. Thus, 
a nucleus reacts not only to changes in the field due to close elec- 


trons, but also to changes in the field due to more distant elec- 
trons. 


Example. The electric 


quadrupole moment of a nucleus of Cl is Q = 
= — 0.07 X 107 cm?, 


Quadrupole resonance occurs te Cla when v = 
= 54.47 me/s. For a nucleus of CI85, the spin s is equal to =: This means that 


s 1 Duster 
the quantum number m assumes the values Ba J Deena eich ours “SINCE 
quadrupole interaction energy is a function of m?, when an electromagnetic 
energy quantum hv is absorbed only one transition is possible, i.e., a transition 


; 3 
from the level corresponding to | m| = z to the level corresponding to 


The resonance condition is 


hv = E; — êz =Q $ (ie (s+) (=) -3 (5 +1) ae ; 


A ae Sey 
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By measuring the resonance frequency v 


of the quadrupole moment Q of a C 
of the electric field due to electri 


and knowing from other data the value 
18> nucleus, one can determine the gradien 
ons at the centre of a Cl®* nucleus in a ©12 


molecule: 
OE 2hv 26.6 x 10-27 x 54.5 x 108 i ; 
| ee = 2.14 x 106 Ga é units. 
1=| 5z 20. E EE E a es tO Gaussian, 
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211. Nucleon Interaction Forces 


Our basic knowledge of nuclear forces can be obtained by studying 
the scattering of particles. It was concluded from Rutherford’s 
first experiments in the scattering of a-particles that nuclear forces 
have a very small range. Rutherford’s results could be explained 
quantitatively by assuming that the deflection of a-particles is due 
to electric repulsion between charged particles having the same sign. 
The experimental results agreed with the results of theoretical cal- 
culations even when an -particle passed extremely closely to the 
scattering nucleus. This means that it suffices to separate two nucle- 
ar particles by a very small distance in order for the effective forces 
to consist only of electric forces; the nuclear forces will then no 
longer be effective. 

More direct evidence can be obtained from the scattering of neu- 
trons by protons. For this purpose, a neutron beam is passed through 
hydrogen in a gaseous state. Experiments indicate that only a small 
fraction of the neutrons collide with nuclei of hydrogen atoms. The 
angular distribution of scattered neutrons is uniform. This result 
differs basically from that of -particle scattering, i.e., scattering 
due to electric interactions. In such scattering, deflection always 
occurs, but the deflection is small when an c-particle passes far from 
a nucleus and large when such a particle passes close to a nucleus. 
It can be concluded from the patterns obtained in the scattering of 
neutrons by protons that the effective range of nuclear forces is very 
small. A value of the order of 2 X 40- cm is reliably deduced from 
such experiments. - 

This is also the value obtained for the effective range from the scat- 
tering of protons by protons. In this case, the experiments and 
calculations are rather intricate since it is necessary to “deduct” that 
portion of the scattering due to purely electric interaction. However, 
the deduction required can be determined using data from observa- 
tions at-high energies and large angles. Unfortunately, direct experi- 
ments in the scattering of neutrons by neutrons are not possible, but 
considerable indirect evidence indicates that in this case too nuclear 


forces have the same properties. For example, let us compare the 
binding energies of tritium (hydrogen isotope of mass 3) and the 
helium isotope of equal mass. In the first case a nucleus consists of 
two neutrons and one proton and in the second of two protons and 
one neutron. It turns out that the binding energy of this helium 
isotope is less than that of tritium by an amount exactly equal to 
the electric interaction between two protons. 

From these experiments one can conclude that the nuclear forces 
acting between nucleons are independent of the electric charges of 


the interacting particles. 
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From experiments in the scattering of nucleons one can also con- 
clude that the interaction is of an exchange nature. The term exchange 
is used to indicate that the colliding particles interchange proper- 
ties, i.e., a proton is transformed into a neutron and vice versa. 
Experimentally, this has been shown to occur in the scattering by 
protons of a beam of neutrons having very high energies (scores of 
times greater than the potential energy of interaction between pro- 
tons and neutrons). One would expect that in such an experiment 
most neutrons pass through hydrogen without scattering, but in 
fact the forward scattered beam consists of equal numbers of 
neutrons and protons. 

The problem of nuclear forces is complicated by the fact that such 
forces depend on nuclear spin orientations. These orientations cause 
nuclear forces to lose their central nature, i.e., the forces do not act 
along the lines joining the centres of particles. 

Many difficulties have not yet been overcome and the nature of 
nuclear forces still remains unexplained. It is quite probable, how- 
ever, that the meson theory of the Japanese physicist Yukawa will 
continue to form the basis of the theory of nuclear forces. This 
theory will be discussed in Sec. 244. 


212. Nucleons in a Nucleus 


Nucleons are very closely spaced in a nucleus. The following for- 
mula for the “radius” of a nucleus is in reasonably close agreement 
with a number of experimental facts: 


R=kyM. 

Here, M is the mass number and k = 1.5 x 10-1 cm. The formula 
has been established for heavy nuclei, but it is undoubtedly valid 
for light nuclei as well. Since the radius is proportional to the cube 
root of the number of particles in a nucleus, the volume is propor- 
tional to the first power of this number. Therefore, it must be assumed 
that, at least to a rough approximation, the packing density of 
nucleons in a nucleus is uniform. 

Since nucleons are of a wave nature, one cannot, of course, con- 
struct a geometric model of an atomic nucleus and determine the 
path of nucleons in a nucleus. . 

As an approximation, each nucleon can be pictured as moving in 
the field of all the others. Such a nucleon will have a system of 
energy levels which can be filled consecutively in going from light 
nuclei to heavier ones. Like in the case of electrons, the lowest level 
cannot have an angular momentum. In accordance with the Pauli 
exclusion principle, at this lowest level we can have two neutrons 
and two protons (the particles of each pair having opposite spins), 
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i.e., an c-particle. An analysis of heavier nuclei shows that stable 
groups of particles will also be located at the other levels. The shell 
model of a nucleus has proved very useful in determining a number 
of nuclear properties and in explaining the prevalence of various 
isotopes. 

Tt will be noted in examining the composition of atomic nuclei 
that as we proceed from light particles to heavier ones the number 
of neutrons in an atomic nucleus increases more rapidly than the 
number of protons. Thus, in the case of the very stable element lead, 
there are 82 protons and 126 neutrons. This increase in the number 
of neutrons can be explained by the need to counterbalance the 
increased electric repulsion of the protons. 

From another viewpoint, of course, the lack of equality between 
protons and neutrons is disadvantageous. ‘When this equality exist- 
ed, the low energy levels could be filled with a maximum number 
of particles since, in accordance with the Pauli exclusion principle, 
two neutrons and two protons can exist in one state. However, if 
this occurred, the electric repulsion would increase too much and 
the total energy would not be a minimum. Evidently, an actual 
case represents a compromise solution between these two tenden- 
cies. Beta decay phenomena, which occur so frequently in artificial- 
ly radioactive elements, represent the selection of the optimum situa- 
tion in the indicated sense. If there are too many protons or too 
many neutrons in a nucleus, this is corrected by the emission of an 
electron or a positron. 

Like in the case of an atom, a nucleus usually occupies the lowest 
energy level, but it can exist in an excited state. It has been known 
for a long time that radioactive decay is often accompanied by elec- 
tromagnetic radiation. The reason for this is that after emitting an 
a-particle a nucleus remains in an excited state and then passes to 
a low energy level by emitting a photon of very high frequency. 
The latter emission is known as y-radiation. A nucleus can also pass 
into an excited state by colliding with another particle. A study of 
the transition energies of a nucleus enables us to construct its system 


of energy levels. 
213. Interaction of Fast Electrons 


If electrons move slowly, the forces of interaction are determined 
arges at the instant of interaction. There- 


by the arrangement of the ch c 
fore, in the case of slow electrons, the fact that an electromagnetic 


field exists is of no significance, i.e., it is unimportant that the 
interaction is transmitted by means of a field. 

The situation is different in the case of fast particles. Here, it is 
necessary to take into account the lag in interaction due to the finite 
velocity of propagation of an electromagnetic field. Interaction at 
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the instant ¢ is determined by the arrangement of the charges at the 
instant t— —. Now, the field cannot be dispensed with. How can the 
C 


quantum nature of a field be taken into account while maintaining 
the particle-field-particle interaction scheme? Such questions are 
dealt with in quantum electrodynamics, a field of theoretical phys- 
ics still being developed. Experimental evidence compels us to 
assume, in considering the interaction of fast particles that such 
interaction consists in the transfer of an energy quantum to the 
field by a particle and the subsequent transfer of this quantum to 
a second particle. 

If the magnitude of the transferred quantum of energy is equal to 
&@ and the time required to transfer this quantum from particle to 
particle is equal to t, then according to the principle of uncertain- 
ty (see p. 479). 


Gt~h. 


If the particles are close to each other, t is small and the particles 
can exchange quanta hv of low and high frequencies. As the distance 
between particles increases, t increases and energy exchange can 
occur only by means of small quanta. Using such reasoning, one 
a derive a formula for the force of interaction between par- 
ticles. 

Many authors describe the quantum interaction of particles through 
the medium of a field in the following picturesque terms: In trans- 
ferring a quantum of energy & to a field, a particle “lends” this ener- 
gy for a short period of time 7. The principle of uncertainty defines 
the dependence of the lending time on the amount of loaned energy. 
It is as if the law of conservation of energy were “violated” for the 
period of time T, i.e., the lending time. The principle of uncertain- 
ty indicates the permissible time interval during which the law of 
conservation of energy may be “violated” in order for the “violation” 
to be physically meaningless. 

This viewpoint may be developed further, but it leads to the fol- 
lowing difficulty. Since there is no limitation in the absorption and 
radiation of photons, infinitely large changes in the value of a parti- 
cle’s intrinsic energy occur when photons are exchanged. 

It is interesting that this occurs in all approaches to the evalua- 
tion of the intrinsic energy of an electron or other charged particle. 
It was seen on p. 243 that a point particle possesses infinite energy: 
At the same time, it is impermissible to assume that the particle 
has finite dimensions since a perfectly solid particle having finite 
dimensions cannot exist according to the theory of relativity. (The 
existence of perfectly solid bodies is inconsistent with this theory 
since interaction would be propagated instantaneously through such 
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bodies.) But if an electron is a deformable particle, what is its struc- 
ture? A solution to this problem has not yet been found. 

A distinctive feature of the new theory is that, in spite of the fact 
that it is based on contradictions and lacks logical harmony, it 
leads to a number of new, very interesting results. 


214. Meson Theory of Nuclear Interaction 


In the preceding article, we stated that interaction between elec- 
trically charged particles occurs through the medium of an electro- 
magnetic wave by means of quanta. An electrically charged particle 
transmits a quantum of energy to an electromagnetic field and this 
energy is then transmitted to another particle. If it is assumed that 
a field is associated with nuclear forces and that this field is also of 
a quantum nature, interaction between nucleons can be described 
as follows: Each nucleon is surrounded by a field; a nucleon transmits 
a quantum of energy to the field and the field transfers this energy 
to another nucleon. 

A theoretical study to determine whether such an explanation of 
nuclear forces is permissible was undertaken by Yukawa. It turned 
out that a theory could be developed if it is assumed that the field, 
through the medium of which nucleons interact, possesses quanta 
having a rest mass that is not equal to zero. Interaction between 
nucleons was reduced in this manner to an exchange of particles 
having a mass m40. For reasons to be explained below, such parti- 
cles became known as mesons. A meson is a quantum of the mesonic 
field which surrounds nucleons. 

Now, let us examine several conclusions to be drawn from the 
theory. First, we shall try to estimate the range of nuclear 
forces. 

The energy of a meson which is transferred by a nucleon during 
interaction cannot be less than moc?, where mo is~the rest mass of 
the meson. The order of magnitude of the time taken to transfer 


h 3 5 
a meson is not greater than -zz (on the basis of the relation ĝt ~h, 
oc? 


y of transferred energy and t is the time taken 


where @ is the quantit l 
i elocity of a meson is less than c, 


to transfer this quantum). Since the v 
. t 

a meson cannot be transferred over a distance greater than mo 

Thus, the value of this constant quantity should give us the range 


of nuclear forces. : 
Such is the conclusion drawn by Yukawa. Using the range of nucle- 


ar forces known at that time, Yukawa showed that theoretical and 
experimental results coincide when mo, the rest mass of a meson, 
is 200-300 times greater than the mass of an electron. 
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Positive, negative and neutral mesons have equal validity in 
this theory. Thus, all interactions between nucleons can be easily 
integrated within the framework of one theory. Designating Yuka- 
wa’s meson by x and using a superscript to indicate the sign of the 
meson, one can express the interaction between two neutrons or two 
protons as an exchange process involving a neutral meson: 


pept; nen H. 


An exchange interaction between a proton and a neutron is a prot- 
ess involving a positive or a negative meson: 


> +e > - 
Penti; nept+r. 


Yukawa’s theory was developed before the discovery of mesons. 
At present, the above interaction formulas no longer represent theo- 
retical predictions, but rather expressions for phenomena which 
have been actually observed. i 


215. Mesons 


The term “meson” (from the Greek word “mesos”, meaning average 
or intermediate) was coined to indicate that the mass of such a par- 
ticle lies between that of an electron and that of a proton. Experi- 
ments show that several kinds of mesons exist. Mesons (electrically 
charged) were first detected in cosmic rays. Now, mesons are created 
in accelerators—they arise when nucleons collide. 

However, not all mesons play the same role in interactions between 
nucleons. The Yukawa meson is a s-meson. As has been indicated, 
positive, negative and neutral x-mesons exist. Recent measure- 
ments indicate that the mass of a x-meson is equal to 273 me. 
where me is the mass of an electron. 

The mesons which were first found in cosmic rays are m-mesons 
(positive and negative). The mass of a p-meson is equal to 207 me. 
Ten years elapsed after the discovery of -mesons before it was shown 
that such mesons are products of a-meson decay. The reason why 
investigators were able to detect u-mesons but were unable to 
detect s-mesons lies in the different lifetime of these particles. The 
average lifetime of a -meson is about 107° sec, but that of a «-meson 
is about a hundredth of this value, i.e., of the order of 4078 sec. 

Such transformations have been photographed numerous times. 
If we follow along a p-meson track on such a photograph, we may 
find that the velocity of the particle decreases (easily recognised by 
the change in the thickness of the particle track: the slower the par- 

ticle the heavier the track since a slow particle creates more ions 
along its path than a fast one). At the point where the track ends, 
another track begins. Calculations indicate that the new particle 
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is of very high energy, which may be explained only if we assume 
that a part of the rest energy of a t-meson has been transformed into 
kinetic energy. In addition, in order to satisfy the law of conserva- 
tion of momentum, one must assume that a neutral particle of very 
small mass is created when a m-meson is transformed into a p-meson. 
Thus, the neutrino is again encountered. 

The rest mass of a x-meson is of the order of 150 Mev. Therefore, 
a meson may be produced by nuclear bombardment with projec- 
tiles having an energy greater than this value. Actually, an energy 
greater than 300 Mey is required. Nuclear bombardment with high- 
energy particles results in the creation of a large number of mesons. 

Charged and neutral mesons have been created under laboratory 
conditions. In the case of neutral mesons, this is particularly diffi- 
cult since the lifetime of such a particle is 10-*® sec. This means that 
the particle traverses a distance of only a thousandth of a milli- 


metre during its lifetime. 


216. Relativistie Theory of an Electron 


When the Schrödinger equation was discussed in Sec. 180, we did 
not use the relations of the theory of relativity since we assumed 
that the velocities of the particles were much less than the velocity 
of light. 

We cannot dispense with the relativistic corrections when dealing 
with high-energy electrons, i.e., electrons having energies of the 
order of millions of electron-volts. Such high-energy electrons are 
encountered in radioactive radiation, in X-ray tubes the operating 
voltages of which are equal to several million volts, in betatrons, etc. 

To correctly describe electrons having such velocities by a wave 
function, one must take into account the relationship between ener- 
gy and momentum given by the theory of relativity. 

If the rest mass of a body is given, the energy and momentum of 
the body can be directly determined from the velocity of the body. 
Therefore, energy and momentum are related. Squaring the expres- 
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ê moe mov 
Z= and p=—=— 
e Vi—pF PT VISE" 


and then subtracting, we obtain 


This relationship can be written in the following form: 


E= V PP me 


= 
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(see Fig. 242). When p = 0, € = + moc. This means that the coor- 
dinates of the points at which the two branches of the curve intersect 
the axis represent equal rest energies of the particle. . 

For a long time, no attention was paid to the negative branch of 
the curve. It seemed impossible that particles could exist which 
obeyed the laws of this branch. Indeed, a particle having negative rest 
energy and the energy of which decreases as the momentum increases 
must behave very strangely. It is like a particle having negative 
mass, which means that when a force is exerted on such an “electron 
the particle is accelerated in the opposite 
direction to that of the force. Thus, if 
we try to attract such a particle, it is 
repelled. Imagine that we have two close 
electrons—an ordinary one and an extra- 
ordinary one, i.e., one which obeys the 
laws of the lower curve. Electrons should 
repel each other. Therefore, the ordinary 
electron will tend to move away from 
the extraordinary one. But since the latter 
“electron” will be attracted under the 
action of the force of repulsion it will 
move toward the ordinary electron. AS 


a result, this pair will be accelerated 
jointly, whereby the increase in positive kinetic energy of the 


ordinary electron will be counterbalanced by the increase in 
negative energy of the extraordinary particle. 

Does such a particle exist? First, it should be noted that from the 
viewpoint of quantum mechanics this is not another particle at all. 
The negative branch of the curve of energy as a function of momen- 
tum should be interpreted as giving the lowest energy level of one 
and the same particle, namely, an electron. An electron may be 
located at an ordinary energy level or at a negative energy level. 
Like in the other cases, transition to the lowest energy level should 
be accompanied by energy radiation. 

To be sure, in such a transition the radiation cannot be limited to 
a single photon. This may be explained as follows. When an electron 
passes from an ĝ4; pı state of the upper curve to an ĝz, px state of 
the lower curve, the energy released is 


Fig. 242 


i — 62 = c0 (V RF mic? -+V Fme), 


i.e., more than c (pı + pə). If this energy were transferred to a sin- 
gle photon, then we should have 6, — 2 = hv and, therefore, pa + 


jab a i.e., the change in the momentum of the photon (g İS 
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anti-parallel to Po) would equal = But G; — &2 is greater than 
c (pı + po), not equal to it. In the case of a single photon, it is not 
possible to satisly simultaneously the law of conservation of energy 
and the law of conservation of momentum, but this is always pos- 
sible if two photons are emitted. : 

We should not be disturbed by the fact that the lower state corre- 
sponds to negative energy and that particles with 6 < 0 behave very 
strangely. The difficulty lies elsewhere: Since there is limitless room 
at the lower level in view of the fact that it extends to all possible 
values of momentum, why do not all ordinary electrons pass over 
to this level? The proposed model accounts for the existence of only 
“extraordinary” electrons. To accord with this model, an ordinary 
electron should have a short lifetime, much like an excited atom. 


217. Creation and Annihilation of Pairs of Particles 


A relativistic theory of the electron was developed by Dirac in 
4928. The description of an electron in this theory differs considera- 
bly from that in the Schrédinger theory. Four wave functions are used 
to describe the behaviour of an electron since one wave function no 
longer suffices. The existence of electron spin does not follow from 
the Schrodinger theory, but electron spin is a necessary consequence 
of the Dirac theory. The success of this theory in describing numerous 
phenomena speaks for the validity of its basic concepts. Here, we 
shall consider in the light of this theory only the contradiction dis- 
cussed in the preceding article. How can we explain the fact that 
a tremendous number of ordinary electrons exist? How can we 
explain the “reluctance” of these electrons to pass over to the lower, 
negative energy level? 

The Pauli exclusion principle does not allow an electron to pass 
over to a lower level if that level is occupied by two electrons having 
opposite spins. Let us apply this principle to our problem. The solu- 
tion will be obtained if we assume that all negative energy states are 
occupied. Does this mean that all space is completely filled with elec- 
trons having a negative energy state? This conclusion appears ines- 
capable. The Dirac theory has led to a new view of vacuum. In this 
theory, vacuum acquires physical properties, i.e., it is filled with 
electrons in a negative energy state. Moreover, it is filled boundlessly 
since the decrease in energy can extend to negative infinity. 

Let us determine which phenomena can be explained and predict- 
ed by this theory. Tf all negative energy states are filled, their exist- 
ence may be detected only when an electron passes from a negative 
energy level to a positive energy level after receiving a significant 
portion of energy—in any case, not less than 2 moc?. Such a process 
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may involve the expenditure of two photons. Another possibility 
is the following: A photon passing close to a heavy atomic nucleus, 
which has a strong electric field, may give up its energy to raise an 
electron from a negative to a positive level. The role of the atomic 
nucleus consists in providing the necessary momentum. 
In both cases an electron is “created”. But in addition to the crea- 
tion of an electron, a “hole” is produced in the negative energy 
states. The removal of an electron, i.e., a negative charge, means that 
- the hole acquires a positive charge of equal magnitude. On the other 
hand, the absence of a particle having negative energy signifies that 
the energy has increased. Therefore, the greater the momentum of 
a hole, the greater its energy. The inescapable conclusion is that 
“holes” behave like positively charged electrons having positive 
energy. Except for the sign of the charge, the behaviour and laws of 


motion of a positive electron (positron) in no way differ from those 
of an ordinary electron. 


A positron and an electron are “created” 
of the energy of photons. The reverse proc 
also possible. This consists in the transformation of a colliding elec- 
tron and positron into two photons or, if annihilation occurs close 
to a heavy atomic nucleus, into a single photon. 

It is understandable why positrons have a short lifetime: they are 


attracted by electrons and disappear upon collision. But why do not 


electrons disappear? The reason is simply ‘that, there is an excess 
of them. 


Do systems exist in 
which electrons areu 


as a pair at the expense 
ess, i.e., annihilation, is 


which there are an excess of positrons, i.e., in 
nstable? Such a supposition isnot at all ridiculous. 
Pair creation and annihilation are processes which can be easily 
observed in large numbers under laboratory conditions. When 
y-rays having an energy greater than 1 Mev pass through a thin 
metal foil, the electrons which are ejected from the foil will be deflect- 
ed in a magnetic field in opposite directions. By tracing the path of 
a positron (means of observing charged particles are discussed in 
Sec. 204), one can determine the point at which the particle disap- 
pears. This is where the positron combined with an electron. Using 
modern photon counter devices, one can show that two oppositely 


directed photons, each having an energy of the order of % 


Chas 
Bae ul- 
z Sim 
taneously emerge from this spot. 


218. Particles and Anti-Particles 
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ason for supposing that the existence of the oe 
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the theory of interaction between nucleons, it is basically similar 
to the theory of interaction between electrons. In most theoretical 
studies, nucleons are considered to be described by equations which 
are quite similar to the Dirac equations for electrons. It was to be 
expected, therefore, that nucleons have anti-particles which stand 
in the same relationship to the proton and neutron as a positron does: 
to an electron. The first of such anti-nucleons to be discovered was 
the anti-proton. Somewhat later the anti-neutron, the magnetic 
moment orientation of which differs from that of the neutron, was 
discovered. (The magnetic moment and the angular momentum 
vector are antiparallel in the case of the neutron and;parallel in 
the case of the anti-neutron.) 

The discovery of the anti-proton proved that the concept of an 
inseparable link between field and particles is well founded. Like in 
the case of a positron-electron pair, a proton-antiproton pair may be 
created by the passage of a particle (a nucleon) from a negative ener- 
gy state to a positive energy state. For this purpose, an energy of at 
least 2Mc? is required. This is a tremendous amount of energy— 
1,840 times the energy required to create an electron-positron pair. 
An accelerator which could accelerate particles to billions of elec- 
tronvolts had to be constructed before it was possible to discoyer 
the anti-proton. 

If a proton and an anti-proton collide, they will be annihilated. 
Since nucleons transfer energy through the medium of a meson field, 
when they are annihilated their mass and energy are given up to 
quanta of this field, i.e., to mesons. This process will undoubtedly 
be subjected to careful study in the years ahead. 

Fig. 243 shows a photograph of the annihilation of a proton and 
an anti-proton. This process occurred in a bubble chamber filled 
with liquid propane. A sketch of the process is given in the upper 
left-hand corner. 

The reasoning used in explaining why anti-particles must exist 
extends to the anti-neutrino as well. An anti-neutrino is the “mirror” 
image of a neutrino. The difference between the particles composing 
this doublet is the same as in the case of the neutron-antineutron 
doublet. j s 

Mu mesons and other elementary particles which have not been 
discussed are also encountered as doublets. ` 

Pi-mesons constitute a triplet: thére is a m-meson having a posi- 
tive charge, another having a negative charge and still another having 
zero charge. In contradistinction to the neutron and the neutrino, 
the neutral «-meson has no spin and, therefore, can have no anti- 
particle (or expressed differently: the neutral s-meson coincides 
with its anti-particle). The photon is another particle which has no 
“mirror image”. 
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219. Asymmetry of Elementary Particles 


Nucleon-nucleon interactions occur with the emission and absorp- 
tion of a-mesons. This is the strongest of the various interactions 
between elementary particles and is responsible for the force binding 
nucleons in an atomic nucleus; its duration is 10-** sec. Nuclear 
forces are about a hundred times as powerful as electromagnetic 
forces; the duration of electromagnetic interaction, which occurs: 
by the exchange of photons, is 107° sec. These two types of interac- 
tion are known as strong interactions, in contradistinction to the 
weak interactions occurring in particle transformations in whick 
neutrinos take part. Examples of weak interactions are the transfor- 
mation of a neutron into a proton, accompanied by the emission of 
a neutrino and an electron (see B-decay), and the decay of m- and 
[i-mesons: oo 

m—>p-+v and p—-e+v+v. 


The duration of weak interactions is of the order of 10-* sec. Nuclear 
forces are about 10" times as powerful as weak interaction forces. 
In calculating the interaction force from the duration of the process, 
we are assuming, of course, that other conditions remain equal. 
Some. extremely interesting discoveries have been made recently 
in connection with weak interactions. It has been found that weak 
processes exhibit “left-right” asymmetry. Thus, for example, in 
B-decay of cobalt nuclei polarised at low temperatures by means of 
a magnetic field (polarisation of the particles orients their magnetic 
moments and spins in a definite direction), the angular distribution 
of electrons is asymmetrical with respect to the “forward” and “back- 
ward” directions. Similarly, in -meson decay, asymmetry with re- 
spect to the direction of motion of the particles has been detected. 
A theory for this phenomenon was proposed by Lee and Yang, and 
by the Soviet physicist Landau. Two explanations are possible: 
either the particles are internally asymmetrical or space is asymmet- 
rical. We shall restrict ourselves to the first explanation. Its essence 
lies in the assumption that elementary particles are like screws as 
regards their symmetry properties. Such asymmetrical particles are 
well known to the physicist. They include, for example, the left and 
right optical antipodes of ilena: Coa P E ahah a 
: ny, , rticle the axi i 
asymmetrical clemontary Bae in the “forward” and “backward” 
oriented has different properties 7 il 
directi Thi heen proved experimentally, ; ; 
I ctions. This has 1 : mmetry observed in experiments with 
ee order to explain t Be A Rg the Landau hypothesis, which re- 
ms of p-mesons, one Ce As indicated in the 
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encountered in nature as charged pairs. Landau proposed that if 
the symmetry properties of a particle are like those of a right-hand- 
ed screw, the properties of its anti-particle are like those of a left- 
handed screw. Reflection in a mirror makes a right hand appear like 
a left hand and a right-handed screw like a left-handed screw. Thus, 
the “mirror image” of a particle is its anti-particle. 

What application has this hypothesis to experiments with beams 
of u-mesons? It can be proved that a particle having no mass must 
be oriented with its spin in the direction of motion. The mass of 
a neutrino is evidently equal to zero. Therefore, neutrinos are “lon- 
gitudinally polarised”. The difference between a neutrino and an 
anti-neutrino can be reduced to the following: the spin orientation 
of a neutrino is in its direction of motion, while that of an anti- 
neutrino is in the opposite direction. Mu mesons are formed when 
st-mesons decay. Since the spin of ast-meson is equal to zero, the spin 
of a p-meson must be parallel to the spin of a neutrino. Therefore, 
the u-mesons formed in a x-meson beam are longitudinal. This 
explains the asymmetry observed in the distribution of electrons 
during theysubsequent decay of these 1-mesons. 

Investigation ofthe asymmetry of elementary particles may result 


in a significant revision of a number of fundamental concepts in 
physics. 


d 


CHAPTER XXXI 


NUCLEAR TRANSFORMATIONS 


220. General Laws of Chemical and Nuclear Transformations 


Now, we shall discuss several energy relations which are equally 
applicable to chemical reactions and to transformations of atomic 
nuclei and other particles. 

Transformations can occur only when particles closely approach 
each other. Since particles must possess a certain kinetic energy in 
order for a transformation to occur we are 
quite justified in using the term “colli- U 
sion” to describe a close approach between 
particles. Not every encounter between 
particles results in a transformation. The 
mechanism of chemical and nuclear trans- 
formations is very difficult to study. 
Since direct observations are impossible, 
one is forced to make hypotheses the 
validity of which can be checked indi- 
rectly. 

In the case of chemical transformations, 
the mutual orientation of molecules upon 

. collision undoubtedly plays an important 
role. In order for a reaction to occur, molecules must approach 
each other in a manner suitable for the regrouping of atoms. 

For every transformation which can occur on a mass scale—the 
usual case in chemical and nuclear reactions where billions of mole- 
cules or nuclei collide during a short interval of time—one can 
indicate, in principle, the number A which, generally speaking, will 
be a measure of the fraction of encounters in which particles are in 
a position “suitable” for a transformation to occur. 

However, the requirement of appropriate orientation is, of course, 
not the only condition to be met in order for a transformation to 
occur. Since a particle is ordinarily stable and, hence, possesses 
a minimum of potential energy, an energy sufficient to lift the mole- 
cule out of its potential well must be imparted to it. This minimum 
necessary energy is called the activation energy. Fig. 244 shows a poten- 
tial energy curve. The particle is stable when r = 0, where r is 
a fixed parameter. In order for a reaction to occur, an activation 


ate 
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Fig. 244 
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energy @ must be provided. For the case} represented in the figure, 
the reaction proceeds with a liberation of heat. 

We can apply the Boltzmann law to collisions between molecules, 
or between nuclei (under similar conditions), i.e., we can assume 
that the number of encounters resulting in a transformation is pro- 
portional to e~6/kT, where @ is the activation energy. 

It is evident that the rate of transformation may be expressed as 
the product Ae~@/*T, where the first factor takes into account the “geo- 
metric” conditions of encounter and the second the energy aspect. 

It is customary to give special consideration to the case of two 
colliding particles at the instant when the potential energy is a maxi- 
mum. Such an activated complex (as it is referred to in chemistry) 
or compound nucleus (as it is re- 
ferred to in the study of nuclear 
transformations) does not exist 
for a very long period of time. 
The system may “slip back” into 
the potential well or “roll over’ 
the well wall. In the latter case, 
a transformation has occurred 
and a new system with a new 
Absorbed potential energy has been 

heat formed. 

In chemical as well as nuclear 
transformations, the resulting 
system may consist of a single 

Fig. 245 new particle (addition reaction) 
or two new particles. 

If the potential energy of the new particles is greater than the 
potential energy of the original particles (cf: the bottom of a vol- 
cano’s crater is below the level of the foot of the mountain on which 
the voleano is located), the transformation proceeds with the absorp- 
tion of energy. The absorbed energy (heat) will be equal to the differ- 
ence between the activation energy and the energy of the reaction 
products (see Fig. 245). If the energy of the created particles is less 
than the energy of the original particles, heat is liberated. 

Both kinds of transformations are encountered in chemistry and 
in nuclear physics. Reactions proceeding with a liberation of heat 
are called exothermic, while those proceeding with an absorption 
of heat are called endothermic. 

Chemical and nuclear transformations are often accompanied by 
radiation. However, as a rule, the main energy effect of a reaction 
consists in the transformation of the potential energy of the atoms 10 
a molecule (or nucleons in a nucleus) into kinetic energy. Therefore, 
generally speaking, transformations in which heat is liberated are 
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those in which two slow particles collide and produce two fast ones. 
Of course, in endothermic reactions, the opposite takes place. 

It is seen from the formula that the rate of transformation increases 
exponentially with temperature. This is the reason why chemical 
transformations are extremely sensitive to changes in temperature. 
The higher the temperature, the greater the impact of colliding par- 
ticles. It is well known that temperature plays an important role in 
chemical transformations. In nuclear transformations, as a result of 
the tremendous values of binding energy, the role of temperature 
change is not so noticeable. The activation energy of atomic nuclei 
has an order of magnitude of several Mey, but if, for example, the 
temperature is increased to 3,000°, the energy of an atomic nucleus 
increases by only 0.4 ev. 

The temperature must be increased to millions of degrees rather 
than to thousands if we wish to accelerate nuclear transformations 
(see below). 


221. Nuclear Reactions 


A mass of experimental data has been accumulated on nuclear 
reactions. The number of such reactions which have been studied 
reaches several thousand. 

At present, the following types of nuclear reactions are known 
(in addition to radioactive decay, which may be viewed as a nuclear 
decomposition reaction): capture reactions, in which two colliding 
particles combine; exchange reactions, in which a particle is cap- 
tured and another is ejected; and fission reactions, in which a nucleus 
breaks up as the result of energy received in one or another form. 
Nuclear reactions which occur under the action of hard y-rays are 
known as photonuclear reactions. 

By means of nuclear reactions, one can obtain stable natural iso- 
topes as well as unstable radioactive isotopes which are not encoun- 
tered in nature. It has proved possible to synthesise elements which 
have no stable isotopes (for example, technetium, an element having 
the atomic number 43), and also transuranic elements. 

The reactions which occur when various nuclei are bombarded with 
a-particles, protons and neutrons have been studied most care- 
fully. 

When an @-particle collides with a nucleus, one of two types of, 
reactions generally occurs: either the a-particle is captured and a pro- 
ton (p) is ejected or the a-particle is captured and a neutron (n) is 
ejected. These reactions are designated by the symbols (a, p) and 
(a, n), respectively. j 

The equation for an (a, p) reaction has the form 


zE“ +204 — gE? + yp. 


=" 
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The equation for an (a, n) reaction has the form 
BERE he ay Of ee TS nh 
Here are some examples of a-particle reactions: 
(@, p) : 7N™* + 204 — 0” + jp}; 
Al + 204 —> Si + yp}; 
(a, n) : ,Be® + .a4 — C£ -+ on}; 


sB10 + pat —> N13 4 ont. 


The first of the above (œ, n) reactions is of great practical impor- 
tance since a mixture of radium («-particle source) and beryllium 
is a common neutron source. 


A large class of reactions occur as the result of collisions with pro- 


tons. Such reactions include (p, œ) reactions (in which a proton is 
captured and an o-particle is ejected), e.g., 


aF + spt —> 016 +2Het, 
and (p, n) reactions (in which the ejected particles are neu- 


trons). 


Capture reactions which are not accompanied by the ejection of 
a particle also occur. The excess energy is released in the form of 
y-rays. Therefore, such reactions are designated as (p, y) reactions, 
e.g., 

3Li’ + ;p'—> ,Be®, 

Reactions involving deuterons (d), e.g., (d, p) and (d, n) reactions, 
have been subjected to careful study. When heavy water (deuterons) 
is bombarded with deuterons, a radioactive isotope of hydrogen, 
viz., tritium, may be formed: 

1D? +D? —> H? +p. 
However, a (d, n) reaction may also occur: 
1D? +D? — He? + ont. 


Reactions involving neutrons are of great importance in nuclear 
engineering since they occur abundantly in nuclear reactors. Such 
reactions include (n, œ), (n, p), (n, 2n) and (n, y) reactions. In addi- 
tion, reactions involving the fission of heavy nuclei occur under 
the action of neutrons (see below). 

Two kinds of reactions occur between nitrogen and neutrons: 


(n, p) : IN + ont — 0 + pts 
(n, 2n) : ;N™ + ont —> „N! + Ion, 
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The first of these reactions yields a carbon isotope of long lifetime 
(more than 5,000 years). This isotope is of great importance in bio- 
chemical investigations. 

Almost all isotopes are capable of capturing a neutron [(n, y) reac- 
tion]. In this manner, an isotope of the same element, the mass of 
which is one unit greater than the original isotope, is formed. Usual- 
ly radioactive isotopes (f-radioactivity) are produced. 

Exothermic and endothermic reactions are possible in nuclear 
chemistry, like in molecular chemistry. The magnitude and sign 
of the thermal effect can be determined using the principle of the 
equivalence of mass and energy. 

If nuclei of masses M, and M, are formed from nuclei of masses 
M, and Mo, then 

M, + M =M; +M, + Am. 


When Am > 0, i.e., when the nuclei of the reaction products have 
less mass than the original nuclei, the reaction is exothermic. When 
Am < 0, i.e., when the nuclei of the reaction products have greater 
mass than the original nuclei, the reaction is endothermic. The 
energy released or absorbed during a reaction can be determined 
from the formula @ = c2Am. Calculated and experimental results 
are in perfect agreement in all cases. 

In most nuclear reactions, the thermal effects are of the order of 
millions of electron-volts for each pair of reacting nuclei. This is 
millions of times greater than the corresponding values in chemical 


reactions. 


Sample calculation. Let us consider the reaction 
,Be® + pt — Lit + 204. 


The following are tabulated handbook values of the masses occurring in this 
reaction: mpeg = 9.01503, mp = 1.00812, mpi = 6.01697 and mg = 4.00390. 
The reaction is exothermic since Am = 0.00228, i.e., Am > 0. Since an atomic 
weight unit corresponds to 931.8 Mev, the thermal effect is equal to 0.00228 x 
Xx 931.8 = 2.12 Mev or about 8X 410714 calories (1 Mev = 3.827 X 


x 10714 calories). 
222. Fission Reactions of Heavy Nuclei 


Atomic nuclei which are heavier than tin nuclei are capable of 
splitting into two parts of approximately equal mass. In order for 
this fission to occur, a considerable amount of activation energy 
must be imparted to such a nucleus (the greater the isotopic mass 
number the smaller the activation energy). For nuclei of uranium, 
thorium and palladium, this energy is equal to about 5 Mev. The 
fission reactions of such nuclei are exothermic; the energy released 


considerably exceeds the activation energy. 
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Fission may be caused by protons, deuterons, «-particles or y-ra- 
diation. However, fission caused by a neutron hitting a heavy 
nucleus is of paramount significance. 


Fission-product yield,% 
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Fig. 246 


Let us consider fission of uranium-235 under the action of neu- 
trons. When a neutron is captured, a uranium isotope with a mass 


223. Chain Reactions 573 


one unit greater than U? is formed: 
o2U235 + pret —> 9 U8. 


Fission occurs if the activation energy of U?® is less than the energy 
supplied by the neutron. The captured neutron brings to the nucleus 
an energy equal to the binding energy of the neutron plus the kinet- 
ic energy of its motion. In the case of U5, the energy supplied 
by a neutron of thermal velocity is sufficient to produce fission. 

Since a large amount of energy is required to split a nucleus of 
U8, the nucleus of this isotope can be split only by means of fast 
neutrons. 

A nucleus of U? generally divides into two unequal fragments. 
Nuclei split randomly and yield various primary products. Fig. 246 
shows the yield of uranium-235 fission products as a function of 
mass. It is seen from the curve that division into equal fragments 
has a minimum probability. Most frequently the masses of the frag- 
ments stand in a ratio of about 2 to 3 (e.g., Sr’ and Xel8*). These 
nuclei have enormous energies—a light nucleus has an energy of 
about 100 Mev and a heavy nucleus 65 Mev. The large variety of 
fission products is due not only to the fact that a nucleus may split 
in numerous ways, but also to the fact that a number of other pro- 
ducts are created when radioactive fragment nuclei decay. 

Nuclear fission products have an abnormally large number of 
neutrons. By means of a chain of transformations, radioactive nuclei 
assume a normal stable state. For example, 
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The half-life is indicated under the arrows. It is evident from the 
values of decay time that, practically speaking, only Cs??? is obtained 
as the product of this chain of transformations. 

Another very important fission product of uranium-230 is stron- 
tium-90 (see below). 

Nuclear fission of uranium-235 is accompanied by the release of 
an enormous amount of energy. Thus, 4 gram of uranium yields as 
much energy as the combustion of 2.5 tons of coal, i.e., 22,000 kwhr. 
Most of the energy is released in the form of kinetic energy of fission 
fragments and about 40% is released in the form of radiation. 


223. Chain Reactions 


All aspects of the uranium-235 reaction have been studied in 
detail. Uranium-235 is the only natural isotope from which energy 
can be obtained on an industrial scale. i 

Since neutrons are required to achieve fission and a neutron gas 
does not exist in nature, a large amount of energy can be obtained 
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only by means of a chain reaction in which new neutrons are contin- 
uously created. This is precisely the method used to achieve nuclear 
fission of uranium-235. Every time a U-235 nucleus is split several 
neutrons are liberated. The number of such neutrons varies from 
case to case, but on the average 2.5 neutrons per fission are obtained. 
If the neutrons formed when a nucleus splits are able to cause 
fission of other uranium atoms, a chain reaction may occur. : 

Since neutrons have a long range, the neutrons formed upon fis- 
sion of a uranium-236 nucleus have a large probability of escaping 
from the substance without splitting other nuclei. Moreover, it 
should be realised that not every encounter between a neutron and 
a uranium-235 nucleus results in fission. 

The growth of a chain reaction is a function of the neutron mul- 
tiplication factor. The value of this quantity (Ko) can be calculated 
for the case of a system having infinite dimensions. To perform this 
calculation, one must know the neutron multiplication due to fission 
by slow and fast neutrons and also the probability of neutron cap- 
ture by nuclei of nonfissionable materials. 

The extent to which Ky exceeds unity is a measure of the increase 
in the number of neutrons of a given generation over the preceding 
generation. 

However, a reactor has finite dimensions. Therefore, the neutron 
multiplication factor must be written in the form K = Ko (1 — p), 
where p is the probability that a neutron will escape from the reactor. 
In order for the reactor to operate, Ko must be greater than unity. 
During reactor operation, the multiplication factor K must be 
exactly equal to unity. : 

The dimensions of a system containing nuclear fuel are said to 
be critical when the multiplication factor of the system is equal 
to unity. 

Let us examine the factors which affect K. The probability of 
a neutron colliding with another nucleus before escaping from the 
substance may be increased by amassing a large quantity of nuclear 
fuel. The same end is achieved by reducing to a minimum the num- 
ber of atomic nuclei capable of absorbing neutrons, since such 
absorption removes neutrons from the reaction. The probability of 
capture may be increased by slowing down the neutrons. (When 
a nucleus splits, fast neutrons are formed, but a uranium-235 nu- 
cleus is best able to capture slow neutrons, i.e., so-called thermal 
neutrons.) 

Nuclei of uranium-235 may be split by fast neutrons as well as 
by slow ones, but the probability of being split by the former 15 
less. If the nuclei of a substance may be split only by fast neutrons: 
it becomes impossible to produce a chain reaction: a neutron create 
during fission of a nucleus and slowed down even slightly as the 
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result of one or two chance encounters which do not result in fission 
will be lost to the chain reaction. 3 

At present, the most important nuclear fuels are able to produce 
a chain reaction with slow neutrons. Such fuels include one natural 
isotope (uranium-235) and two artificial elements, viz., plutonium- 
239 (obtained from uranium-238) and uranium-233 (obtained from 
thorium-232). 

The quantity of nuclear fuel in a particular volume must exceed 
a certain minimum in order for a chain reaction to begin. We need 
not be concerned about the first neutron since, thanks to cosmic 
radiation, a small number of neutrons are always present in the 
atmosphere. Moreover, spontaneous fission, i.e., fission due to inter- 
nal forces, also occurs occasionally. (This phenomenon was discov- 
ered by the Soviet scientists Flerov and Petrzhak. They found 
that it is not always necessary for a neutron to be captured in order 
for fission of a uranium-235 nucleus to occur.) Finally, a mixture 
of radium and beryllium may serve as a source of initial neutrons. 


224. Principle of Operation of a Nuclear Reactor 


If a chain reaction begins in a given mass of nuclear fuel and if 
the reaction is uncontrolled, the result will be an explosion since at 
each instant the number of neutrons, and hence the quantity of 
released energy, will be greater than at a preceding instant. The 
quantity of energy released in a small fraction of a second will be 
so great that an explosion results. 

In order to release a constant or controlled amount of energy, one 
must build an installation in which the neutron multiplication 
factor can be controlled. An installation which allows us to do this 
is called a nuclear reactor or pile. In such an installation, it must 
be possible to begin a chain reaction with a multiplication factor 
slightly greater than unity. Then, the concentration of neutrons 
inside the pile, and hence the power of the reactor, will begin to 
increase. After raising the power to the desired level, one must be 
able to set the multiplication factor exactly equal to unity. The 
reaction then becomes self-sustaining since the number of neutrons 
and the energy released per unit time remain constant. 

A reactor should be constructed in such a manner that the neutrons. 
formed upon fission are utilised most effectively. But effective 
utilisation of neutrons does not mean that they must be used exclu- 
sively for the fission of nuclei and the release of energy. Substances 
the nuclei of which absorb neutrons may be introduced into the 
pile. By means of reactions with neutrons, we can obtain large 
quantities of useful artificial radioactive isotopes and, what is 
extremely important, artificial nuclear fuel. Thus, a nuclear reactor 
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may be used not only to produce energy, but to produce artificial 
isotopes as well. 

Neutrons formed upon fission have a velocity of tens of thousands 
of kilometres per second. The velocity of thermal neutrons is of the 
order of 1 km/sec (0.025 ev). Such slow neutrons are most effective 
in producing fission. 

Nuclear reactors operating with a natural or enriched mixture of 
uranium-235 and uranium-238 are of great importance. 

Resonance absorption of neutrons will occur in uranium-238. 
Strong absorption occurs, for example, at an energy of 7 ev. When 
a reactor is loaded with a mixture of isotopes, it is absolutely essen- 
tial to decelerate neutrons below this energy value. 

Thus, the basic elements of a reactor are fuel, a neutron modera- 
tor, a neutron absorber to control the multiplication factor, and 
shielding to protect personnel from neutrons and y-radiation emitted 
during nuclear transformations occurring in the reactor. 

The small RFT uranium-graphite reactor of the U.S.S.R. Acade- 
my of Sciences is a good example of a well-designed reactor. This 
reactor is used for various physical investigations and for the pro- 
duction of artificial isotopes. Its power is 10,000 kw and the flux of 
thermal neutrons at the centre of the reactor is equal to 8 X 40° neu- 
trons/em? sec. The core is in the form of a cylinder the diameter 
and height of which are equal to 1 metre. The cooling water and 
the graphite serve as a neutron moderator. A number of horizontal 
and vertical holes, through which beams of neutrons emerge, pass 
through the reactor shielding. Materials to be irradiated are placed 
in the reactor core through special apertures provided for this 
purpose. 

A considerable number of nuclear reactors have already been put 
into operation or are in the design stage. Such reactors may differ 
from one another with respect to: 1) the type of fuel used (pure 
nuclear fuel, enriched fuel, or natural uranium in the metal form or 
in the form of a chemical compound); 2) the fuel pattern (space 
lattice, rod lattice, or uniform distribution of the fuel in a solution 
or suspension); 3) the moderator (light or heavy water, graphite or 
beryllium); and 4) the type of cooling (water, gas or no cooling). 
Reactors may be designed to have any power ranging from a fraction 
of a kilowatt to hundreds of thousands of kilowatts and may operate 
with slow (thermal) or fast neutrons, depending on the type a” 
quantity of moderator used. 

Reactors are designed to be controlled completely automatically- 
Monitoring is achieved by means of neutron detectors located in the 
reactor walls. Such detectors are able to measure neutron fluxes 
ranging from 1 neutron/cm? sec to 5 X 10% neutron/cm? sec. The 
neutron-sensitive material of a detector may consist of boron-10 oF 
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uranium-235 (fused to the electrodes of an ionisation. chamber) or 
boron fluoride gas (i.e., the ionisation chamber is filled with BF; 
gas). In the former case, the chamber is filled with argon, nitrogen, 
helium or air at a pressure of 2 atmospheres. 

The operation of the ionisation-current amplifiers in the various 
relay mechanisms which transmit the neutron detector signal to the 
actuators controlling the motion of the control rods and safety rods 
must be extremely reliable. 

The position of the control rods varies during reactor operation. 
This is because the quantity of material which absorbs neutrons 
increases as the decay products accumulate. Some of this “poisonous” 
material, e.g., gaseous products, may be removed from the pile 
automatically. 

Nevertheless, gradual withdrawal of the control rods is necessary 
io maintain the neutron density at a constant level. After a period 
of time of the order of 5-20 months the reactor becomes so “poisoned” 
that further operation becomes impossible. The pile must be cleansed 
of decay products and reloaded with new fuel. Fission products 
which are strong absorbers of neutrons include ruthenium-103, 
xenon-131 and 135, neodymium-143, samarium-149 and 451, 
europium-151, 152 and 155 and gadolinium-155. 

Of course, the formation of artificial fuel also causes the multi- 
plication factor to change. 

If in addition to nuclear fuel there is a quantity of uranium-238 
or thorium in a reactor, such a reactor will not only liberate heat 
but will produce artificial nuclear fuel—plutonium from uranium- 
938 and uranium-233 from thorium. 

A reactor in which the quantity of fissionable material increases, 
or at least remains unchanged, is called a breeder. In constructing 
such a reactor, one must use a fissionable material which allows us 
to obtain an average number of neutrons per fission of more than 
two. The condition for fuel reproduction is that the number of atoms 
splitting each second be no less than the number of uranium-238 
or thorium atoms transformed each second into plutonium or ura- 
nium-233, respectively. 

In order to transform the liberated heat into electrical energy, 
one must proceed exactly in the same way as in the case of thermal 
electric power plants operating on coal. Heat extraction from a reac- 
tor is a very difficult engineering problem. However, this problem 
has been solved. The first atomic electric power station has been 
operating in the U.S.S.R. for several years and one has also been 
put into operation in Great Britain. During the next few years, 
a number of large atomic electric power stations will be constructed 


in the Soviet Union. 
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225. Artificial Radioactive Products 


Nuclear reactors constitute the most prolific source of radioactive 
products. Every hour of reactor operation yields a definite quantily 
of products of nuclear fission. This process cannot be controlled so 
as to obtain desired products. “Common” fission products having 
a relatively large (sufficient for practical utilisation) decay period 
include: 


Kr, Sre, Sro, Ts p3, Xel33, Cs137 and Bal’, 


With the further development of nuclear engineering, the quan- 
tity of radioactive fission products (fragments) obtained in reactors 
will reach imposing values. A reactor having a power of 1 kw yields 
a quantity of products which radiates 10° curies in 100 days. At 
present, by no means all of them have practical applications. Of 
most interest are caesium-137 and strontium-90. These have a rela- 
tively long lifetime and are obtained in reactors in relatively large 
quantities. The atomic industry produces them in pieces with an 
initial radioactivity of the order of 1,000-2,000 curies. This is equiva- 
lent to the radioactivity of 1-2 kg of radium. 

Radioactive fission products are used in research work, produc- 
tion (process and quality control), medical treatment, etc. Some 
radioactive fission products are used to sterilise food products, 
antibiotics, vaccines, etc. Such sterilisation requires a very. large 
dose. This is obtained from radioactive preparations yielding 
a thousand curies over a very short period of time—of the order of 
minutes or hours. The radioactivation of irradiated objects is not 
dangerous since fission products radiate B- and y-rays and these do 
not produce appreciable radioactivity in an irradiated substance. 
If packaged goods or goods protected by a natural envelope, e.g., 
eggs or fruit, are subjected to y-ray sterilisation, the bacteria are 
destroyed and new bacteria cannot penetrate through the packing 
or envelope. Large-scale sterilisation by radioactive irradiation may 
replace refrigeration in many cases. 

An important field of application of radioactive isotopes is ra- 
diography. Cobalt-60* and caesium-137 replace X-ray machines in 
the flaw detection of metals: The use of y-rays instead of X-rays 
means that an X-ray tube and its high-voltage equipment are replaced 
by cheap fission products: the basic equipment of a radiographic 
laboratory may be reduced to an ampule of radioactive matter an 
a fluorescent screen or photoplate. Radiography, moreover, has 
broader applications. For example, the inside of a pipe may be 
examined with y-rays, something that is physically impossible to 


* Cobalt-60, which is not a fission product, will be discussed later. 
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do with X-rays because of the size of an X-ray tube. Artificial iso- 
topes may replace radium salts used to paint the dials of clocks and 
other instruments. 

By means of strontium-90, air may be ionised to eliminate static 
electricity, the accumulation” of which is undesirable in various 
industries. It is useful to ionise the air in combustion engine 
chambers since this increases the rate of combustion. 

By means of strong f-radiators, one can construct low-power 
sources of electric current (less than 4 milliwatt). There are several 
ways of creating electrical devices by means of streams of electrons 
from f-radiators. One of the most promising methods is the irradia- 
tion of germanium or silicon with electrons. Layers possessing uni- 
directional electrical conductivity may be manufactured from these 
elements. The distinguishing feature of semiconductors of the ger- 
manium or silicon type is that under the action of external electrons 
a large number of electrons are set free inside these substances. As 
a result of unidirectional displacement of the liberated electrons, 
a potential difference of the order of 0.2-0.3 volt is created across 
a semiconductor pair. The efficiency of such an electrical device can 
be gleaned from the fact that one external electron produces up to 
200,000 internal electrons. Therefore, in spite of the fact that the 
efficiency in the utilisation of radioactive radiation is only 1%, a 
source of 50 millicuries with a half-life of 20 years (strontium-90) 
enables us to produce a low-power generator of electrical energy 
at an insignificant cost. 

A specific isotope may be obtained in two ways: by means of 
a nuclear reactor or by nuclear bombardment in an accelerator. 

When we place a substance in a nuclear reactor, it is subjected to 
the action of neutrons. Since most elements easily react with neu- 
trons, it is possible to obtain radioactive isotopes of almost all the 
chemical elements. As a result, nuclear reactors constitute the main 
source of radioisotopes. 

Irradiation with neutrons is accomplished as follows. The substance 
to be irradiated is placed in an aluminium capsule the length of 
which is about 7 cm, diameter about 2 em, and wall thickness 0.1- 
0.2 mm. If the substance is volatile or may react with aluminium, 
it is placed in a quartz ampule, which is wrapped with quartz fibre 
before being inserted into the aluminium capsule serving to protect 
it mechanically. Ordinary glass cannot be used since it is a strong 
absorber of neutrons. If a large object is to be subjected to irradia- 
tion, a big aluminium box must be constructed for it. Reactors 
designed for the production of radioisotopes are provided with 
tunnels of various cross-sections, through which aluminium boxes 
and capsules may be introduced into the reactor to a great depth. 
After irradiation, the boxes and capsules are placed in lead containers. 
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Radioisotopes are formed in a reactor as the result of neutron 
capture or the dislodgment from the bombarded nuclei of protons, 
and, less frequently, «-particles, by neutrons. At first the number of 
radioactive atoms in an irradiated substance increases uniformly 
with irradiation time. Then, the rate of increase in activity decreases 
until, finally, saturation sets in. For different substances, this 
process proceeds at different rates, depending on the half-life of the 
element. The rate of increase in activity is uniform as long as the 
half-life is 5-10 times greater than the irradiation time. Saturation 
sets in when the irradiation time becomes approximately 5-10 times 
greater than the half-life. 

It should be noted that a nuclear reactor cannot produce the va- 
riety of isotopes produced by a cyclotron. This is quite understand- 
able since in a reactor the conditions of bombardment of atomic 
nuclei are restricted by the nature of the bombarding particle, i.e., 
the neutron, and its range of energies. But there are even objections 
to the use of reactors to obtain large quantities of an isotope such as 
CM, The initial material introduced into a reactor to obtain C! is 
an excellent absorber of neutrons and therefore cannot be used in 
large quantities. Thus, accelerators have a definite role of their 
own to play in the production of radioisotopes. 

The effectiveness of an accelerator is determined by the energy 
and number of nuclei ejected from the accelerator per unit time. 
The number of nuclei can be easily calculated from the average ion 
current, which, in a cyclotron, is equal to 10-44 a. Knowing the 
charge of an ion, one can easily determine that 108 nuclei, or 
2 x 10% gm in the case of deuterium, are obtained each 
second. 2 

In addition to the above facts, one must also know the “effective 
cross-section” of the reaction in order to determine the rate at which 
a particular radioactive substance will be formed under the action 
of a beam of nuclei. This quantity is designated by o and its value 
is given in cm?. It has the following significance. If S is the area of 
a bombarded sample, L is the probability that the nuclear projectile 
will hit the target nucleus and produce the given reaction. The order 
of magnitude of o is usually about 10-** cm?. In the case of certain 

nuclei, the absorption of neutrons increases sharply at definite 
neutron velocities. For example, cadmium is a strong absorber of 
slow neutrons with an energy of 0.18 ev. The effective cross-section 
of this reaction is of the order of 7,000 x 10-24 em?. 

The cross-section of a given reaction depends to a great extent 
on the energy of the bombarding particle. If a sharp peak occurs 
in a curve of cross-section plotted against incident particle energy, 
the peak value is referred to as the resonance cross-section. 
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It is evident from the values given above that the production of 
significant quantities of isotopes often requires many weeks of 
accelerator operation. 

The radioactive elements obtained from reactors and accelerators 
are widely employed as tracers (whence the designation “tagged 
atoms”) in practically all branches of science and engineering. The 
mass of material required for all such purposes is not very large, 
but since elaborate industrial facilities serve to produce only small 
amounts of radioactive isotopes this industry is rapidly expanding 
to meet the growing demands. 

The presence of a radioactive isotope in a substance is generally 
determined by means of a counter. In this manner, an activity of 
less than 107! millicurie can be measured. Thus, a quantity of 
radioactive phosphorus having a weight of less than 10-1 gm can 
be detected. 

Widely used isotopes include the following: Cobalt-60, which is 
B--radioactive with a half-life of 5.2 years; since it is a strong B-ra- 
diator it is widely used in gamma-raying and irradiation. Carbon-14, 
which has a half-life of 6,360 years; widely used in biochemistry, 
geochemistry, and in the study of the kinetics of chemical reactions. 
Phosphorus-32 and sulphur-35, which are f~-radioactive with a half- 
life of 14.3 and 87 days, respectively; one of their most important 
uses is in agriculture in the study of fertiliser assimilation by plants. 
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Theoretical calculations indicate that atomic nuclei of almost 
all elements can serve, in principle, as sources of energy. It turns 
out that any nucleus heavier than the nucleus of a silver atom pos- 
sesses more energy than the components into which it may be divid- 
ed. All heavy nuclei release energy when they split up. The heavier 
the nucleus, the greater the magnitude of this energy. That is why 
uranium is the “best” nuclear fuel. 

However, light nuclei can also serve as sources of energy. Theoret- 
ical calculations indicate that a nucleus obtained by the fusion 
of two light nuclei will possess less energy than the original parti- 
cles. Therefore, energy is released upon fusion of light nuclei. Here, 
too, the further away from the mid-point an element stands in the 
Mendeleyev periodic table, the greater the amount of energy 
released in such a reaction. The greatest amount of energy is obtained 
from the fusion of nuclei of hydrogen atoms. 

What conditions are required to achieve fusion between light par- 
Nuclear bombardment cannot yield the desired result since 
article is decelerated rapidly in a substance. The only 
aise the temperature. It is easy to calcu- 
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late the temperatures that atomic nuclei require to enable them to 

approach each other closely, i.e., to overcome electric repulsion. 
Calculations indicate that the temperature at the centre of the 

Sun is 20 million degrees. Knowing that the energy associated with 


$ 1 : iui 
each degree of freedom is equal to- 4T, one can determine the average 


kinetic energy of a particle at this fantastically high temperature. 
This energy is equal to merely 3,000 ev. Now, let us calculate from 


b S a A q? 
the formula for the potential energy of electrical interaction, U = a 


how closely two protons will approach each other. It turns out that 
the distance will be equal to 5 X 10- em. As we know, the radius 
of a nucleus is considerably less than this value. Nevertheless, ther- 
monuclear reactions, i.e., reactions occurring at a high temperature, 
are possible in the Sun. Calculations which take into account the 
tunnel effect, and the fact that in every gas, including a gas of 
nuclear particles, there are particles the velocities of which consider- 
ably exceed the average, indicate that in a year one atom in a million 
takes part in nuclear fusion. This small fraction is sufficient to 
account for solar activity. i 
Under terrestrial conditions, such high temperatures have been 
repeatedly created in the United States and the Soviet Union during 
hydrogen bomb tests. Temperatures of scores of millions of degrees 
are created during uranium bomb explosions. If a substance whose 
nuclei are capable of combining and releasing energy is located 
within the zone of this explosion, a thermonuclear reaction, the 
energy of which is many times greater than that of a uranium bomb, 


will occur. The uranium bomb in this case serves to trigger the 
thermonuclear reaction. 


Here are examples of the most easily re 
release large quantities of energy: 


D? + Ht= He; Li?7+ Ht = 2He?; 
H?+Ht—Het, CH. —i= Nu, ` 


Thermonuclear reactions can occur only at temperatures which 
give nuclei a thermal velocity sufficient to overcome, with appre- 
ciable probability, the Coulomb potential barrier. 

Of greatest interest are the reactions in deuterium and in mixtures 
of deuterium and tritium since these require the least amount of 
energy. A temperature of 200,000° is required to obtain one neutron 
per second in a gram of deuterium (in accordance with the reaction 
1D? + ,D? = He + ont). In a highly rarefied gas, an even higher 
temperature (of the order of 500,000°) is required for the same pur- 
pose. At such a temperature, deuterium (like other substances) will, 
form a plasma of nuclei and electrons. To transform deuterium to 


alised reactions which 
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this state requires very little energy—of the order of several kilo- 
watt-hours. However, the difficulty does not lie in transmitting 
this energy to deuterium, but rather in providing thermal confine- 
ment, i.e., the deuterons must preserve the corresponding kinetic 
energy for a long period of time. 

A possible solution to this problem was proposed by Sakharoy 
and Tamm. They suggested that thermal confinement be achieved 
by utilising the electrodynamic forces which constrict parallel cur- 
rents. When a voltage is applied across a plasma, it is constricted, 
i.e., a plasma column is formed (pinch effect). This constriction 
reaches its culmination when the temperature of the plasma increases 
to a million degrees. At such instants, deuterium nuclei initiate 
a thermonuclear reaction. This theory has been confirmed by Soviet 
scientists and the method may prove to be the most practical means 
of utilising the energy of thermonuclear reactions. Experiments 
have been conducted with electric discharges in tubes with discharge 
gaps of up to 2 metres, and currents of 100,000 to 2,000,000 amperes 


have been investigated. 
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227. Polycrystalline Substances and Monocrystals 


As a rule, crystals begin to grow around a very large number of 
centres in a melt or solution. If special measures are not adopted, 
a polycrystalline substance rather than a monocrystal will form as 
the result of crystallisation. Under a microscope such a substance 
seems to consist of individual grains (see Fig. 247). Each grain is 
a crystal which has an irregular 
haphazard form due to the fact thal 
TG its normal growth has been impeded 
Lye A by neighbouring crystals. Most bod- 
4 les commonly encountered, partic- 
| ularly metals and rocks, are poly- 
crystalline substances. 

The boundary between grains is 
revealed by etching with an ap- 
propriate solvent. Thisis due to the 
fact that most of the impurities of 
a substance accumulate at the grain 
boundaries. The interlayer between 
crystals differs from the “body” of 
the grain not only in that it contains 
foreign atoms, Dut in that its atoms 
3x7 have a distur ed (transitional) 
4 arrangement. The basic structure of 

the boundary between grains is 
clearly visible under a microscope as peculiar, smooth “paths”. The 
usual size of grains in metals and rocks is 10-4-1410- cm. 

A single crystal (monocrystal) of any crystalline substance may 
be found in nature or artificially grown. A monocrystal is distin- 
guished by its regular shape, i.e., plane faces, straight edges, and 
symmetry, in other words, the proportionality of its component 
parts. This regular shape reflects a crystal’s internal properties, 
which enable us experimentally to distingush a crystal from a bit of 
material given such a shape artificially. It is also not difficult to 
recognise a crystal when its characteristic features are hidden. Thus, 
a sphere may be fashioned from a large crystal of rock salt but, 
when it is placed in water, surface material is dissolved at a non- 
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uniform rate and, as this process goes on, its accidental shape tends 
to be transformed into the polyhedral shape which is natural for 
this substance. A monocrystal can be easily distinguished from 
a polycrystal by means of X-ray analysis. 

A naturally formed crystal has the shape of a polyhedron. As in 
the case of every polyhedron, a crystal has a certain number of faces 


(p), edges (r) and corners (e), 


which are related to one another 

as follows: p +e =r + 2. For S 

example, a cube has 6 faces, 8 

corners and 12 edges. 

Crystal faces are arranged in e| [mn 

bands or zones. A system of faces 

the intersections of which are aN 
DSS 


parallel edges is called a zone, 
and the direction of these edges 
is called the axis of the zone. 
Crystals of one and the same 
substance may differ considerably 
with respect to shape, but it has 
been long known that a given sub- 
stance has characteristic angles 
between faces and edges. (Depend- 
ing on chance, one component 
of a crystal may grow more than 
another; as a result, apparently, 
the proportionality between com- 


ponents may be upset.) This im- 
portant rule, which may be called the law of constant angles, is 


illustrated in Fig. 248. In the figure, we see four different crystals 
of silicon dioxide. It is seen that the number of faces and their relative 
dimensions differ from specimen to specimen, but the angles between 
corresponding faces and edges remain unchanged. 


Fig. 248 
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The distribution of matter in a crystal may be represented by 
a three-dimensional periodic function. This is the fundamental rule 
lying at the basis of crystal investigations. 

Fig. 249 shows a wall-paper pattern. A certain element of this 
pattern is repeated in two directions. Consider any point A in the 
figure. A system of lines may be drawn through the selected points 
(nodes) as shown. A pattern element the repetition of which yields 
the full pattern is enclosed within a cell of the resulting grid. Evi- 
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dently, the entire pattern can be obtained from a single cell by 
means of parallel translations of the cell vectors a and b. 

A crystal constitutes a space lattice, not a plane lattice. An ele- 
ment of a crystal is a parallelepiped based on three translational 
vectors a, b, c, which may be selected, generally speaking, in an 
infinite number of ways. Such a 
parallelepiped will be called an 
elementary or unit cell, the vectors 
a, b, c the basic translational 
vectors, or simply vectors, and 
their lengths a, b, € the basic rep- 
etition periods or lattice spac- 
ings. The lattice is described in 
a system of coordinates the axes 
of which coincide with the direc- 
tions of the basic vectors. Differ- 
ent. ways of selecting basic vec- 
tors, i.e., an elementary cell, are 
illustrated for a two-dimensional 
case in Fig. 250. An elementary 
cell in the general case is an ob- 
lique-angled parallelepiped with 
edges a, b, cand angles « = b,c, 


B=c,a, y =a, b. The six quan- 
tities which uniquely describe 
an elementary cell are called its 
parameters. Since the entire lat- 
tice is determined when an elemen- 
tary cell is given, the above 
Fig. 249 2 quantities are sometimes called 
the parameters of the lattice. 
A cell in the form of an oblique-angled parallelepiped is said to 
be triclinic and if @ = y = 90°, monoclinic. A cell in the form of 
a right-angular parallelepiped is said to be rhombic and if in addi- 
tion a = b, tetrahedral. If a = b + c, œ =f = 90°, and y = 120°, 
a cell is said to be hexagonal. The simplest cells have the form 
of a cube. 
If one of the lattice points is selecte 
nate system, the radius vector of 
by the formula 


d as the origin of the coordi- 
any other lattice point is given 


Rmnp = ma +nb + pe, 


where m, n, p 


are whole numbers representing the coordinates of 
these nodes. 


The indicated numbers are called the indexes of the 
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nodes. The set of three indexes describing lattice points is designated 
by the nodal symbol [[mnp]]. £ 

There are an infinite number of nodal lines and nodal planes. 
Nodal lines and nodal planes are represented in a lattice by infinite 
families of parallels. The transition from one line to another of the 
same family, or from one plane to another, occurs by translation 
along a vector joining two nodes of these lines or planes. Each family 
of nodal lines is described by the lattice spacing along a nodal line 
and the direction, i.e., incline, to the selected coordinate axes. 


o O GO 9i TONO TONTO 


Fig. 250 


To describe a family, we select the line passing through the origin 
of the coordinate system. A nodal line is described uniquely by the 
indexes uw, v, w, of the first lattice point lying on this line. The indexes 
of this lattice point are called the indexes of the line and are desig- 
nated by [uvw]. If an index is negative, a minus sign is placed above 
the numeral. The symbol [100] represents the a-axis of the lattice, 
[010] the b-axis, and [001] the c-axis. The lines [014] and [011] 
represent plane diagonals in the face be. Of course, [044] and [041] 
are one and the same line. Distinguishing between these two designa- 
tions has significance only if we wish to emphasise the polarity of 
the direction. The spatial diagonals of a cell have the symbols [111], 
[1441], [111] and [114]. There are four of them, corresponding to the 
existence of eight quadrants; the other four symbols represent the 
same lines, but with reverse polarity. Thus, [114] is anti-parallel 
to [411], étc. 

A space lattice can be constructed as follows. First, an infinite 
plane-lattice (nodal plane) is formed by means of two translational 
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vectors; then, a space lattice is formed by means of a third transla- 
tional vector which does not lie in this plane. A crystal lattice can 
be represented by families of nodal planes in an infinite number of 
ways. Every family of nodal planes consists of parallel planes sepa- 
rated from one another by equal distances. For a given lattice, 
specification of the interplanar distance and the orientation of one 
of the planes relative to the selected coordinate axes completely 
describes a family-of nodal planes. It is also sufficient to give the 
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Fig. 254 


orientation relative to the selected axes of the plane closest to the 
origin. The distance of this plane from the origin will be equal to 
the interplanar distance of the given family. 


š b 
Let this plane intersect the lattice axes at the coordinates + E 


ce . . . . . 
and Tie, fractions of the basic lattice spacings. The numbers h, 


k and l, which describe the orientation of the plane, will be called 
the indexes of the plane. It is easily seen that k, k and l are whole 
numbers. One way of showing this is as follows. Consider a plane 
passing through an initial lattice point and another plane, of the 
same family, displaced by an amount a. This is shown in Fig. 2 1. 
Other planes will pass through these nodal planes, but they must Þe 
separated from one another by equal distances. Therefore, the repe- 
tition periods along the selected axes will be divided by the nodal 
planes into a number of equal parts. 
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The plane closest to the origin and intersecting the axes at 4. à + 
aor 


and + of the lattice spacings is described by the set of three indexes 


h. k and l. Its symbol is designated by enclosing these indexes be- 
tween round brackets: (kkl). For example, the plane (236) intersects 
the axes at the coordinates = t L and È . Any plane which intersects 
the axes at coordinates which are a multiple of these values is a 
member of this family. Thus, in the case under consideration, the 
successive planes beyond the 
one closest to the origin will 
intersect the axes at the fol- 
lowing coordinates: a, 2b, 33 
2. a, b, > etc. 

If a plane intersects the axes 
at negative coordinates, this 
is indicated by a minus sign 
above the corresponding index. 
It is evident that the planes 
(hkl) and (hkl) belong to the 
same family. Therefore, all 
the signs of the indexes of a 
plane may be reversed. 

Ifa plane is parallel to a co- 
ordinate axis, the correspond- 
ing index is equal to zero. Thus, 
(110) is a plane that is parallel 
to the c-axis, (001) is the lattice plane ab (see Fig. 252), (010) is the plane 
ac, and (100) is the plane be. Planes passing through one of the axes 
and one of the diagonals have indexes consisting of two ones and 
one zero. For example, the plane (101) is a plane which is parallel 
to the b-axis and passes through the diagonal extending from the 
terminal of vector --a@ to that of vector +e (not the diagonal passing 


through the origin). The plane (101), which passes through the 
terminals of the vectors —@ and —é, belongs to the same family. 


The plane (101) and its “reverse side” (101) are also parallel to the 
b-axis and pass through the diagonals ac which do not begin from 
the zero lattice point, but extend from the terminal of vector +@ 
to that of vector —¢ and the terminal of vector —a@ to that of vector 
+e, respectively. 

A symbol consisting of three units refers to planes passing through 
three diagonals. These planes pass through the terminals of all 
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three lattice vectors. Thus, the plane (111) passes through the ter- 
minals of the vectors —a, —b and +e. 

The indexes of a family of nodal planes are at the same time Lhe 
indexes of crystal faces. Two parallel faces have the indexes (hil) 


and (hkl). 
229. Cell Selection and Crystal Symmetry 


The vectors a, b and ¢ of a lattice may be selected in a variety of 
ways. If there are no lattice points within an elementary cell, we 
call it a primitive cell. 

Various ways of selecting a primitive elementary cell are shown 
in Fig. 253. In view of the periodicity of a space lattice, the magni- 
tude of the volume associated with each lattice point is a constant. 


Fig. 253 


Its value is equal to the volume of a primitive elementary cell, re- 
gardless of the manner in which such a cell is selected. Since each of 
the eight lattice points at the corners of such a cell is “shared” by 


eight cells, Lof the lattice points bélongs to the given cell. Thus, 


on the average, there is one lattice point per cell. 

In a number of cases, it is expedient to select an elementary cell 
so that its volume is greater than that of a primitive cell. Thus, 
in order to take maximum advantage of the symmetry of a crystal, 
we often select an elementary cell with an additional lattice point 
at face centres or at the centre of the cell. Three cases are encoun- 
tered frequently: 

1) Body-centred cell. An additional lattice point is located at the 
intersection of the spatial diagonals of the cell. In this case, there 
are two lattice points per cell: [[000]] and [zr] . The lattice 
point at the centre of a cell belongs entirely to the given cell. The 


229. Cell Selection and Crystal Symmetry 591 


eight lattice points at the corners are shared jointly by eight cells, 


F 1 : Š 
ies, = of each of these lattice points belongs to the given cell. 


2) Face-centred cell. An additional lattice point is located at the 
centres of a pair of faces, e.g., ab. In this case, too, there are two 


lattice points per cell: [[000]] and [z ]] ? 


3) All-sided face-centred cell. An additional lattice point is locat- 
ed at all face centres. In this case, there are four lattice points per 


ce: ooon, [[0 44]; [+0 +] an¢ [i e] 


The following designations are commonly used: P—primitive 
cell; A, B and C—face-centred cells with a lattice point in faces 
bc, ac and ab, respectively; F—all-sided face-centred cell, and J— 
body-centred cell. 

As emphasised earlier, a lattice point is an arbitrary point, but 
for convenience is selected in a specific manner. The succeeding 
lattice point is separated from the selected one by a distance equal 
to the lattice spacing. Thus, there is one lattice point per primitive 


cell. 

All the atoms of a primitive cell may be replaced by lattice points. 
Usually, a point of intersection of symmetry elements is taken as 
a lattice point. 

A primitive cell may consist of many atoms. When there are 
many atoms in a cell, the se is described by the coordinates 

ary cell. 


of the atoms in an element ERE” 
The model of a crystal as a space lattice is in full accord with 


experimental data. Crystal edges and faces correspond to nodal 
straight lines and planes. The angles between crystal faces and 
edges are the same for all crystalline objects of a given chemical 


compound. 
The symmetrical features of crystal structure may also be deter- 


mined from the space lattice model. i 
l e different symmetry. If a 


Crystals of different substances hav 1 r 
crystal is well formed, its symmetry 1S self-evident. It is clear that 
planes and axes of symmetry may be passed through a crystal in 


a specifi anner. ; Ae 
ET symmetry of a crystal can be explained by its inter- 
nal structure, i.e., the symmetry of the space lattice. 
In addition to axes of symmetry, symmetry elements encountered 
in crystals include mirror planes and inversion or symmetry centres. 
Fig. 254 illustrates the operations which may be performed by 


means of these symmetry elements. 
It has been bag known that axes of symmetry of fifth order and 
axes of symmetry of higher order than the sixth do not occur in 


— 
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crystals.* Basing himself on this fact, A. V. Gadolin proved that 
there can be only 32 groups of symmetry among crystals. 

Before the development of the space lattice theory, it was unclear 
why axes of fifth, seventh, etc., orders were not encountered in 
crystals. This and other features 
of crystal symmetry can be ex- 
plained by the space lattice the- 
ory. 

Let us consider rotation of a 
lattice plane. Rotations which 
are not possible for a plane will 
certainly not be possible for the 
entire lattice. 

Fig. 254 Assume that an axis of n-th or- 

‘der passes through the lattice 

point B and that the identical one closest to it passes through the lattice 
point A (see Fig. 255). Rotation about axis B transfers lattice point 
A to A’ and rotation about axis A transfers lattice point B to B’. 
It is seen from the diagram that B’A’ = AB (1 + 2 cosa). But 


EES ae 


A 
Fig. 255 


the distance A’B’ must be a multiple of the lattice spacing AB, 
for A'B’ is parallel to AB. Therefore, 2 cosa must be equal to 
a whole number. It follows that cos a can assume only the values 0, 


SA Ż and +1, and œ the values 60°, 90°, 120°, 180° and 360°. This 


also follows from the definition of a closed symmetric operation: 
the rotation angles are equal to 360° divided by a whole number. 
Thus, it is possible to have rotational axes of symmetry of sixth, 
fourth, third, second and first orders in a crystal. 


* Tf a body is turned about a certain axis by an angle 2T so that the 
>, n 


figure obtained coincides with the original figure, such an axis is called an 
axis of symmetry of n-th order. For example, a 3-faced prism based on an isos- 
celes triangle has an axis of symmetry of third order which passes through the 
centre of the triangle and is parallel to the edges of the prism. 
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One can prove easily that an axis of sym i i 
which is normal to a nodal plane. oae a PE 

The symmetry of a crystal is determined by the symmetry of the ; 
space lattice. But it should be realised that the symmetry of a lat- 
tice is considerably richer. Only 32 groups of crystal symmetry exist 
but, as was first shown by E. S. Fyodorov, the founder of structural 
crystallography, a space lattice has 230 types of symmetry (Fyodo- 
rov groups). 

The richer symmetry of a lattice is due to the fact that in addition 
to closed symmetric operations it includes a symmetry element 


Fig. 256 


which is not possible for a body, i.e., translation. A symmetric 
operation consists in the displacement of a body to a position which 
is indistinguishable from its original position. Therefore, in the case 
of an infinite lattice, a displacement along one or another nodal 
line is a symmetric operation. 
Translation introduces the following new symmetry elements: 
4) a combination of rotation and translation—screw axes; and 2) a 
- combination of reflection and translation—slip planes. A screw axis 
of fourth order is shown in Fig. 256 (right diagram). Each of points 
2, 3, 4 and 5 is obtained from the preceding one by a 90° turn and 
displacement along the axis by + of a period (t). In Fig. 256 (left 
diagram) we see triangles reflected in a plane Q and slipped along 


AA’ by = of a period. 
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230. The Packing of Particles in a Crystal 


Figures having a definite volume and shape can be stacked or 
packed. It is by no means clear to what extent a formation of atoms 
will conform to this picture. Here, the answer to the following ques- 
tion is of prime importance: If we attribute a definite shape to an 
atom or group of atoms, will this even roughly correspond to a mini- 
mum on the potential curve of particle interaction in a given direc- 
tion? Moreover, to what extent will the volume attributed to the 
atom or group of atoms encompass all the electrons, including the 


Diketopiperazine 
Fig. 257 


valence electrons, belonging to this atom or group of atoms? If it 
turns out that describing the limits of an atom or molecule by a 
definite contour has physical meaning, we will have ascertained at 
the same time how this shape is manifested when a crystal is formed 
from particles. 

The nature of interactions between atoms in a crystal is infinitely 
varied. However, several limiting cases can be singled out: pure 
ionic bonds, homopolar bonds, metallic bonds and molecular bonds. 
As examples, let us consider the structures of rock salt, zinc sulphide, 
iron and the organic substance diketopiperazine (see Fig. 257). 
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In the first three cases, the absence of a tight group of atoms is 
characteristic.-A molecule cannot be distinguished in a crystal of 
rock salt. Each atom of sodium has six perfectly equal chlorine 
atoms as neighbours. Nor can a molecule be distinguished in the 
other two compounds, which are examples of homopolar and metallic 
bonds. ¢ 

In ionic crystals, the particles consist of positive and negative 
ions, which attract one another in accordance with the laws of elec- 
trostatics. Everything said about ionic bonds in molecules applies 
to crystals. The nature of ionic structures is communicated reason- 
ably well if the ions are represented by spheres having definite 
ionic radii. The values of ionic radii in ionic molecules differ little 
from those in crystals. 

In crystals with a homopolar bond, every two atoms are linked 
by a pair of electrons. In this manner each zinc atom is linked with 
its surrounding sulphur atoms, and each sulphur atom with its sur- 
rounding zinc atoms. If a sign of a homopolar bond were considered 
to be an indication of a molecule, it could be said that an entire 
crystal constituted a single molecule. It is physically meaningless 
to consider homopolar crystals as constructed of contiguous spheres. 
The dimensions of free atoms of sulphur and zinc are considerably 
greater than the distance between them in zinc sulphide. A homo- 
polar bond brings the atoms into close contact and makes regions 
in which electrons of these atoms are located common regions. If 
the structure of zinc sulphide consisted of contiguous spheres, 
a large portion of the electron cloud would be located outside the 
spheres, i.e., only 25 per cent of the volume would be occupied by 
spheres. 

Metallic bonds will be discussed in Chapter XXXVII. Here, a few 
words on this subject will suffice. In metals and alloys, outer elec- 
trons are common, forming an electron “gas”. The lattice of a metal- 
lic compound consists of atomic residues (positive ions) “cemented” 
by electrons. Here, too, it is physically meaningless to represent the 
structure as contiguous spheres, in spite of the fact that formally 
the structures of certain metals can be represented as closely packed 
spheres. J : ¢ 

In crystals of the same type as diketopiperazine the molecules are 
distinct. They can be easily recognised since the intermolecular 
distances are considerably greater than the intramolecular distances. 
By studying the arrangement of molecules in crystals, crystallo- 
graphers have been able to pick out intermolecular radii of portions 
of spherical surfaces describing the limits of a molecule. The model 
of a molecule formed of microscopic spheres of intermolecular radius 
is based on an analysis of crystal structures. Such a geometric repre- 
sentation of crystals formed of molecules is quite justified since 

38% 


596 Atomic Structure of Bodies 


most of a molecule’s electron cloud is contained within the contour 
of the molecule. 

It must be concluded that the representation of the component 
particles of a crystal as geometric figures is meaningful in two cases: 
in ionic and in molecular crystals. On the other hand, such a rep- 
resentation is meaningless in the case of homopolar crystals and 
metallic compounds. 

Now, the following question arises: How is the shape of ions and 
molecules manifested in the formation of a crystal? The answer is 
that it is manifested in the compact packing of particles. Experi- 
ments indicate that molecules are always packed in such a way that 
a “projection” of one molecule fits into a “depression” of another. 
There is a clear tendency for the molecules to become so oriented 
with respect to one another that the volume of an elementary cell 
is as small as possible. The situation is similar in the case of ionic 
crystals. The stacking of spheres occurs in such a manner that the 
large spheres fit closely together, while the small spheres (ions) fit 
into the empty spaces of the basic structure. 

In representing ions by spheres, and molecules by spatial figures, 
we find that the “empty” space is equal to 25-35 per cent of the total. 

Close packing in molecular and ionic crystals provides basic 
proof that shape and volume are meaningful attributes of atoms 
and molecules. 


231. Molecular Crystals. 


: The assertion that molecules are bound by intermolecular forces 
ina molecular crystal is, of course, pure tautology. What then can 
be said about this concept and about the nature of intermolecular 
forces? 

The forces of attraction acting between molecules are interaction 
forces of electrical origin. A molecule possesses electrical properties 
even though it is electrically neutral, i.e., its total charge is equal 
‘to zero. 

In order to take these properties into account, all we need do is 
consider a molecule to be a dipole. As we know (see p. 252), dipoles 
attract each other. If-all the molecules of a substance are polar, 
i.e., have a constant dipole moment, inherent dipole interaction 
exists between them. If a portion of the molecules are polar (the 
substance consists of several kinds of molecules), such molecules 
may induce dipole moments in neighbouring molecules. This is 
called induced interaction. 

Finally, it may be said that dipole interaction always exists be- 
tween momentary dipoles. (Every molecule is a momentary dipole 
at. every instant of time, since the centre of gravity of electrons 
never coincides with that of nuclei.) 
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It is not surprising that the potential energy of interaction is 
represented by a very complex function. The calculation of the 
dependence of the potential energy of interaction on the mutual 
orientation and spacing of molecules in a crystal is hardly possible. 
The potential of the attractive forces can be calculated only when 
the distances between molecules are large. When these distances 
are much greater than the dimensions of the molecule, the forces 
of interaction are inversely proportional to 7’. This is not the case 
in crystals and liquids. 

The enumerated forces of attraction are counterbalanced by the 
forces of repulsion between the electron clouds of neighbouring 
molecules. The Pauli exclusion principle does not permit the inter- 
penetration of clouds, so the forces of attraction bring the molecules 
into “contiguity”. 

Although the binding energies of different molecular crystals may 
differ considerably, intermolecular distances in no way depend on 
molecular polarity, binding energy, etc. Evidently, the separation 
between molecules is determined mainly by the well-defined bound- 
aries enclosing the electron cloud of a molecule. i 

The shape of a molecule determines the nature of the molecular 
packing as well as the intermolecular distances. Molecular polarity 
and other characteristics of the forces of attraction not only do not 
affect the intermolecular distances, but also do not disturb the 
tendency to compact packing of molecules. Thus, in practically all 
cases, minimum energy is achieved by compact packing of mole- 
cules. Since molecular crystals strictly obey the principle of compact 
packing, only an exceedingly small variety of structural types and 
symmetry are encountered among such crystals. £ 

It is convenient to view the packing of molecules in crystals as 
a tight packing of compact layers. Two types of such layers are 
encountered. These are illustrated in Fig. 258. The one with right- 
angled cells has greater symmetry. In this case (more than 90 per 
cent of organic crystals are formed of such layers), the molecules 
are stacked in the characteristic zig-zag manner shown in the figure. 
The rows of molecules forming a layer are connected by a screw 
axis of the second order (2,). This means that one row of molecules 
can be transformed into an adjacent one by a 480° turn and a displace- 
ment of half a period along the axis. 

In compact layers, each molecule has six close neighbours. When 
the layers are stacked, a molecule usually obtains six additional 
close neighbours—3 above and 3 below. Thus, the total number 
of such neighbours becomes equal to 42. 

Crystals having a high order of symmetry are rarely encountered 
in the world of molecular crystals. It is not possible to pack com- 
pactly unsymmetrical molecules in symmetrical crystals. 


Tab è 
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If a molecule possesses symmetry, this does not mean that the 
crystal also has such symmetry. A molecule of naphthalene has a 
high order of symmetry; three mutually perpendicular planes of 
mirror symmetry may be drawn through it (see Fig. 259). If these 


Fig. 258 


symmetry elements were preserved in a formation of molecules, the 
packing would be insufficiently compact. Therefore, the symmetry 
elements of a molecule which prevent greater compactness are “lost” 
in the formation of a crystal. 
The preservation of a centre 


/ 
- 8 = ran of inversion is possible without 
; sacrificing compactness of mo- 
lecular packing. A crystal 
formed of molecules possessing 
this symmetry element usually 


does not preserve other sym- 
metry elements of the mole- 
cules, but does preserve a 
centre of inversion. 

In other cases as well, the outcome of the tendency to symmetry 
as opposed to the tendency to compactness can be reliably pre- 
dicted. 

An example of a packing arrangement typical of molecular crys- 
tals is provided by diketopiperazine (see Fig. 257). The molecules 
have a high order of symmetry, but the crystal preserves only a cent- 
re of inversion. A molecule, of course, does not cease to be highly 
symmetrical simply because a crystal of such molecules does not 
possess its symmetry elements. 


Fig. 259 
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232. Compact Packing of Spheres 


A very important class of ionic crystals may be represented by 
a compact packing of spheres. 

Most anions are larger than cations. In such cases, crystals con- 
stitute a compact packing of anion spheres between which cations 
are located. This is how silicates, one of the largest groups of natu- 
ral inorganic substances, are formed. In silicates, the cations are 
located in the empty spaces of a 
compact structure of oxygen anions. 

Let us examine the laws of com- 
pact packing of spheres, i.e., the 
fundamental structures of a great 
number of crystals. The. only pos- 
sible arrangement ofa compact layer 
of spheres is shown in Fig. 260. 
Bach sphere has six neighbours. To 
form a compact packing arrange- 
ment, one must place the spheres 
of a second layer in the spaces of 
the layer below it. It is not possible 
to fill all the spaces with spheres 
of the same size: every second 


space in the figure is filled (crosses 
denote the spaces of the first layer which are filled by spheres of the 


second, and dots denote the spaces which remain vacant). 

There is also only one possible arrangement for compact packing 
of two layers of spheres. For three layers, however, the situation is 
different. In order to achieve compact packing in this case, one 
must place the spheres of the third layer in the spaces of the second, 
but this can be done in two ways: the centres of the spheres of the 
third layer may be placed either above the centres of the spheres 
of the first layer or above the spaces denoted by dots. Both three- 
layer structures are packed equally compactly; however, they differ 
significantly from one another. When a fourth layer is added to the 
structure, the number of possible packing arrangements becomes 
even greater: from the two three-layer structures, one can make four 
four-layer structures. In the case of a five-layer structure, there are 
five possible arrangements, etc. It is evident that the number of dif- 
ferent arrangements of spheres packed equally compactly increases 
very rapidly with increasing number of layers. 

Now, let us compare a crystal lattice with such an arrangement 
of spheres. A crystal may be represented as a structure of spherical 
‘atoms in which the arrangement of layers is repeated exactly after 
a certain number of layers. If such a sequence begins with the four- 


600 Atomic Structure of Bodies 


teenth layer, for example, this means that a cell is thirteen layers 
high. In such a case, the fourteenth layer is above the first, the 
` fifteenth above the second, the sixteenth above the third, etc. 
The simplest packing arrangement consists of two layers: the 
third layer lies above the first, the fourth layer above the second, 
etc. (see Fig. 264, right). This is a hexagonal compact packing 


Fig. 261 


arrangement. A cell of such a crystal is shown in the lower right- 
hand corner of the figure. The dots and crosses denote the locations 
of the centres of the spheres. 

Three-layer crystals, in which the fourth layer is a repetition of 
the first, the fifth of the second, etc., are very common (see Fig. 261, 
left). In the lower left-hand corner of the figure, where only the 
centres of atoms are indicated, we see that a cubic elementary cell, 
centred in all faces, may be selected. Here, the compact layers are 
arranged perpendicular to the spatial diagonal of a cube. Such a 
structure is called a cubic compact-packing arrangement. 

Empty spaces remain in a packing of spheres of equal size. It can 
be easily calculated that the volume of such spaces is equal to 
about '/, of the overall volume. There are two kinds of such empty 
spaces: one is surrounded by four spheres the centres of which are 


233. Examples of Crystal Structures 601 


located at the vertexes of a regular tetrahedron (see Fig. 262a); the 
other is surrounded by six spheres—the centres of these spheres 
form a regular octahedron (see Fig. 262b). The first kind is smaller 
in size, but there are twice as many of them as the second. 

Tt can be shown that in any compact arrangement of equal spheres 
there are two small empty spaces and one large one per sphere. 
Small spheres can fit into these spaces but if they are somewhat too 
large, they cause the large neighbouring spheres to move apart, 
loosening the compact packing 
arrangement. 

Since different packing arran- 
gements are possible with equal 
numbers of spheres, and small 
spheres may fill the empty spaces 
in different ways, ionic crystals 
have a great variety of struc- 
tures. 

In crystals of common salt, a 
compact three-layer structure is : 
formed by large chlorine ions Fig. 262 
(light-spheres in Fig. 257), and so- 
dium ions (dark spheres) fill all the large spaces; hence, every sodium 
atom is surrounded by six chlorine ions. In iron disulphide (pyrite), 
a compact two-layer structure is formed by large sulphur ions; iron 
ions fill all the large spaces. In a crystal of lithium oxide, the chemi- 
cal formula of which shows that there are two lithium atoms for 
every oxygen atom, the compact structure is formed by large oxygen 
ions. Since lithium ions fill all the small spaces, each lithium ion 
has four neighbours (oxygen ions). In a crystal of cadmium chloride, 
the chemical formula of which shows that there are two chlorine 
atoms for every cadmium atom, the compact structure is formed 
by large chlorine ions; cadmium ions fill large spaces but not all, 
i.e., they fill the large spaces of every third layer of chlorine ions. 
We have presented, of course, only the simplest patterns” of the 
filling of empty spaces in compact packing arrangements. 
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i ies formed of mole- 
The largest group of crystals consists of bodies 
cules. Tonie E also constitute a fairly large group. As 


indi eady, the representation of a crystal in these cases as 
E ER particles is entirely justified. However, it is neces- 
sary to examine those structures in which the direction of the bonds 
between atoms, the deviation of the electron cloud from spherical 
symmetry, etc., are the cause of structural arrangements which 
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cannot be represented so simply. Such exceptions include structures 
of atoms bound by common electrons. 

Most metals have structures consisting of body-centred cubic 
cells. In such crystals, each atom has eight neighbours rather than 
twelve, as is the case in a compact packing of spheres. This is the 
case, for example, for atoms of iron (see Fig. 257). The lattice of 


Hg 


Fig. 263 


iron is cubic; iron atoms are located at the corners and centres of 
cubes. Lithium, potassium, caesium and a number of other sub- 
stances possess such a structure. 

In Fig. 263, the structure of crystalline mercury is compared with 
an ideally cubic compact packing arrangement. It can be seen that 
the nature of the arrangement of atom centres is the same in both 
cases, but in the former the distance between layers is less, and the 
distance between atoms in a layer is more than in the latter. This 
is analogous to a compact packing of slightly flattened spheres. 

Many examples exist of such more or less “damaged” compact 
packing structures. In the case of ice (see Fig. 264), all resemblance 
to a spherical packing arrangement is lost. The link between each 
pair of oxygen atoms is implemented by a hydrogen atom. In these 
four bonds, there are two oxygen atoms per hydrogen atom. (The 
structure shown in Fig. 264 does not, of course, contradict the chem- 
ical formula for water.) For purposes of clarity, a “hydrogen” bond 
is shown in the figure as a “neck”. The structure of ice is very loose, 
as indicated by the large “holes”. If one projects the structure above 
the plane of the figure, these holes are transformed into broad chan- 
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nels which pass through the structure. The structure of ice is an 
important exception to the general rule. This does not mean that 
the cases in which the likening of a crystal to a compact packingof 
particles is meaningless are rare. P 
As indicated above, crystals 
formed of atoms bound by com- 
mon electrons cannot be likened 
to a compact packing of spheres. 
The structure of zinc sulphide, 
which is illustrated in Fig. 257, 
is quite typical. Moreover, sev- 
eral elements reveal a similar 
structure. These include carbon 
(diamond), silicon, germanium 
and tin (white). 
Homopolar bonds can form 
layers and chains of atoms. 
Fig. 265 shows the structure 
of graphite. The carbon atoms in 
graphite form a layered structure, 
but these layers are not the same 
as those of a compact packing arrangement. It is not possible to 


form a layer of graphite of contiguous spheres. In graphites, layers 
of strongly bound atoms constitute planes. Arsenic and phosphorus 
also form layered structures in this 
sense, but the atoms of their layers 
are not arranged in a single plane. 
An example ofa structure consist- 
ing of chains of strongly bound 
atoms is provided by gray sele- 
nium. Each atom of this sub- 
stance is strongly bound to only 
two neighbours. In gray selenium, 
the atoms form an endless spiral 
about a straight line. The separa- 
tion between atoms of neighbour- 
ing spirals is considerably greater 
than the separation between close 
f P atoms in a given spiral. 
Fig. 265 The black, lustreless, soft graph- 
ite used in pencils and the shiny, 
d which can cut glass are composed of the 
same kind of atoms, i.e., carbon. This is a very striking example 
of how greatly the properties of a crystal are affected by the arran- 
gement of its atoms. Refractory crucibles capable of withstanding 


Fig. 264 
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temperatures of 2,000-3,000° C are made of graphite, while at temper- 
atures exceeding 700° diamonds are burned up; graphite has a spe- 
cific weight of 2.4 as compared with 3.5 for diamond; graphite 
conducts electric current, but diamond does not; etc. 

The ability to form different crystals is not a property of carbon 
alone. Almost every chemical element in the crystal state and 
every substance has several forms. Six forms of ice are known, nine 
of sulphur, and four of iron. 

At room temperature, atoms of iron form cubic lattices, the atoms 
being located at the corners and centres of cubes. Thus, each atom 
has eight neighbours. At high temperatures, iron atoms form a com- 
pact structure. Here, each atom has twelve neighbours. Iron possess- 
ing eight neighbours is soft, while iron possessing twelve is hard. 
The quenching of steel fixes, at room temperature, a compact cubic 
structure which is stable at higher temperatures. 

We have seen in the cases of carbon and iron that the structures 
of different crystals of one and the same substance may differ consid- 
erably from one another. The same holds true for other substances. 

Thus, for example, in a crystal, yellow sulphur forms corrugated 
rings of eight atoms each. Each ring constitutes a sulphur molecule. 
Red sulphur also consists of such rings but they are turned com- 
pletely differently with respect to one another. 

Yellow phosphorus atoms form cubic structures with eight close 
neighbours. Black phosphorus has a layered structure similar to 
graphite. 

Gray tin has a structure similar to that of a diamond. Theoreti- 
cally, white tin could be obtained from gray tin by strongly com- 
pressing the diamond-like structure along the axis of a cube. As 
a result of this flattening process, a tin atom would have six close 
neighbours instead of four. 

Organic substances also frequently have a variety of crystal 
forms. The very same molecules are arranged differently with respect 
to one another. 


234. Thermal Vibrations in a Crystal 


From the viewpoint of energy, an ideal crystal is, in a way, the 
antithesis of an ideal gas. 

In an ideal gas, the interaction energy of particles is much less 
than the average energy of thermal motion k7’. On the other hand, 
since strong coupling exists between particles in a crystal, the inter- 
action energy is much greater than k7. Therefore, thermal motion 
in crystals cannot disrupt the coupling between atoms, but merely 
results in small vibrations of the atoms about equilibrium positions- 
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In a crystal, every atom vibrates about an equilibrium position. 
For most crystals, the vibration amplitudes are of the order of 
0.4 A, i.e., a small fraction of the distance between close atoms, 


which, as we know, is of the order of 1.5-2 A. 

The nature of this vibration may be very complex. During a vi- 
bration period, an atom describes a complex curve about its equilib- 
rium position. This is due to the fact that an atom is bound to its 
neighbours by different forces; hence, its vibrations are of an ani- 
sotropic nature. In any case, it is always possible to resolve the 
vibrations of an atom along three axes. Evidently, the atoms of 
a crystal will have 3N degrees of freedom, where NV is the number 
of atoms. ` 

If the molecules of a crystal are clearly distinguishable, it is 
meaningful to speak of vibrations of a molecule and vibrations of 
atoms within a molecule. Since the coupling between molecules is 
considerably weaker than between atoms, the frequencies of their 
vibrations will be less. In molecular crystals, the motion of a molecule 
as a whole is of decisive importance. The vibrations of a molecule 
about its equilibrium position are of a translational as well as tor- 
sional nature. Apparently, it is even possible in rare cases for total 
rotation of molecules about centre of gravity to occur. For example, 
such rotation of molecules probably occurs in the case of solid 
methane (CH,). 

The total energy of a vibrating particle consists of its potential 
energy and its kinetic energy. The average values of these two ener- 
gies over a period of vibration are equal to each other. As is known, 


i : 3 
the average kinetic energy of an atom in a gas is equal to z kT. It 


would seem reasonable to assume that, at the same temperature, 
a vibrating atom possessing twice the average energy possesses 3 kT 
units of thermal energy. Then, one mole of crystalline substance 
should have an energy 3RT and a molar heat capacity C, = 3 R ~ 
æ 6 cal/mole. : 
At high temperatures, this formula is in very close agreement with 
experimental results. The temperature dependence of the heat capac- 
ity of crystalline bodies is shown in Fig. 266. Beginning at zero, 
the heat capacity increases, attains a value of 6 cal/mole at a certain 
temperature, and then remains unchanged. The ratio of eee 
ture to a constant 0 (to be discussed in the following article) is plot- 


ted along the a-axis. ON 

At high eS eae the value of kT’ is significantly greater than 
the difference between vibrational energy levels; hence, the en 
nature of distribution of vibrating atoms according to- energy levels 
does not affect the value of the average vibrational energy. Under 
such conditions, the simplified method of calculating the average 
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energy is quite justified. This is confirmed by exact calculations 
which take into account the distribution of atoms according to ener- 
gy as given by the Boltzmann law. 

When &7 becomes comparable to the difference between energy 
levels, the Boltzmann law is no longer applicable and must be re- 
placed by a quantum distribution law (see p. 684). Calculations indi- 
cate that heat capacity decreases with decreasing temperature. We 
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Fig. 266 


shall not present these calculations; we note, however, that quali- 
tatively a decrease in C, with decreasing temperature is quite under- 
standable. The smaller the magnitude of kT, the smaller the number 
of energy transitions that may occur in a system. This means that 
the possibility of thermal exchange decreases with decreasing kT 
and approaches zero as a limit. The limiting case can be explained 
as follows: Energy cannot be transmitted to a body in extremely 
small bursts k7' even if there are an infinite number of such “bursts” 
with a very high total energy. Energy cannot be transmitted since 
a single “burst” does not suffice to transfer a system from its zero- 
energy level to the next one. 

Experiments indicate that energy transitions corresponding to 
a change in the state of molecular motion in a crystal lie in the region 
of long infrared waves. 

For purposes of guidance, let us assume that such transitions cor- 
respond to a wavelength of 1 mm. 
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Let us compare several values of kT with the quantum energy cor- 
responding to a wavelength of 1 mm. For this wavelength, v = 
= 3 x 10! sec}, i.e., kv = 200 x 10-” erg. 


kT (ergs) 


Temperature (°K) (Approx. values) 
500 7,000 10-17 
100 4380x1071? 
10 138 10-17 
1 14x 10-17 


It can be seen that at 100°K the thermal vibration energy still 
considerably exceeds the difference between the energy levels of 
a molecule in a crystal. At 10°K it has the same order of magnitude 
and at 14°K thermal vibrations are unable to produce transitions 
from-one level to another. 


235. Thermal Waves 


An interesting feature of thermal vibrations in a crystal is their 
occurrence in the form of thermal waves. Therefore, atomic vibra- 
tions cannot occur independently of one another. An atom deviating 
from its equilibrium position pulls along the next one. 

Since a crystal is a finite body, standing waves are formed within 
it. As in the case of all natural oscillations, the maximum length of 
a standing wave equals twice the dimension of the body. The bound- 
aries of a crystal must correspond to standing-wave nodes. 

On p. 135 we discussed elastic vibrations of solids considered as. 
a continuum. It was shown that there arise in a finite solid body 
numerous standing waves of different direction and frequency. The 
picture becomes considerably more complex when the atomic struc- 
ture of the solid is taken into account. A theoretical investigation of 
the possible vibrational motion of atoms in a monocrystal indicates 
that thermal motion in a crystal can be represented as the result 
of the superposition of 3sV waves, where N is the number of cells. 
and s the number of atoms per cell. The number of possible waves 
is equal to the number of degrées of freedom of the system of atoms. 
forming the crystal. How do these waves arise and how are they 


manifested? 5 ; : 
Let us restrict ourselves to a consideration of a chain of atoms, 


ie., a “unidimensional crystal”. Fig. 267 shows ‘such an atomic chain, 
the “cells” of which.consist of two atoms, denoted by black and white: 
dots. Since the actual thermal motion of atoms in a crystal is of 
a very complex nature, the figure has been simplified to show the: 
“elementary” waves into which this motion can be resolved. Calcula- 
tions indicate that the resulting vibration can always be represented 
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as the sum of harmonic vibrations. Like in the case of a solid rod, 
a series of waves of various wavelengths arise in a unidimensional 
crystal. If a chain consists of a thousand cells with period a, there 


arise V waves of wavelength 2,000 a, 1,000 a, Z008 a, 500 a, 400 a, 


3 
etc. The shortest wavelength is 2 a. 

But this is not all. Each of the possible wavelengths occurs in s 
variations. Two types of waves of the same wavelength are shown in 
the figure. A case exists in which the lattice of atoms vibrates as 
a whole. Such a wave is called an acoustical wave. The remaining 
s — 1 waves are quite different. In 
these cases, different types of atoms 
execute complex motion with respect 
to one another and at each instant 
only atoms of a single type fall on 
the sinusoid. There are s — 4 such 
vibrations, which are called opti- 
cal vibrations. 

The figure shows waves corre- 
sponding to. atomic vibrations in 
a single direction. 

Atomic vibrations can always 
be resolved into two transverse 
components and one longitudinal 
? ; component. Therefore, a wave of 
given wavelength travelling in a given direction will have 3 compo- 
nents of the acoustical type and 3 (s — 1) of the optical type. Of the 
total of 8sV waves, 3N—two transverse and one longitudinal for 
each wavelength in each direction—will be acoustical. This is com- 
pletely valid for three-dimensional crystals. 

Although the wavelengths and frequencies of the waves have dis- 
crete values, we may utilise the results obtained on p. 136 and 
express, approximately, the number of acoustical vibrations having 
frequencies less than v as 


Fig. 267 


40 3 

Hees 
Here, v is the volume of the crystal and c the velocity of the wave. 
The velocities of the longitudinal and transverse waves are differ- 
ent. Therefore, the total number of waves, which is equal to 3N, 
should be written as follows: 


åw í 2 
4 ata) Vmax = 3N, 


3 


where c; is the longitudinal velocity and c; the transverse velocity 
of the wave. Whence, the value of the maximum frequency of vibra- 
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tion, Vmax, can be easily found. The corresponding wavelength, 


Cr Cy 


mn Vmax Vmax’ 
is, as it should be, of the same order of magnitude as a cell period. 

If the velocity of propagation of acoustical waves in a crystal is 
known, we can calculate Vmax, the value of which determines to a 
large extent the behaviour of a crystal. 

As was indicated above, heat capacity depends on the commensur- 
ability of the energy ky and the thermal energy kT. If Wmax < kT, 
thermal exchange excites all of the vibrations and waves in a crys- 
tal, i.e., all of the quantum transitions are possible. As a result, 
the quantum nature of thermal exchange is not apparent. Such 


aed hVmax 3 d 
a crystal has a characteristic temperature 0 = A which is much 


less than the temperature of the experiment, i.e., 0 < T. On the oth- 
er hand, if kvmax > kT, i.e., if the characteristic temperature O»T, 
only vibrations of low frequency are excited in the crystal because 
high energy levels cannot be surmounted by the thermal “bursts”. 

Here are examples of the characteristic temperatures (°K) of 
a number of crystals: . 


Pb Benzene Ag NaCl Fe Be Diamond 
90 150 215 280 450 4,000 1,860 


For such substances as lead and benzene, room temperature is “high”. 
This corresponds to the horizontal portion of the heat capacity curve 
(C, = 6 cal; see Fig. 266). On the other hand, room temperature is 
low for beryllium and diamonds. Thermal vibrations of these sub- 
stances are excited to an insignificant extent and their heat capacity 
is considerably less than 6 cal/mole. The maximum vibration fre- 


quencies vmar = can be calculated from the above values of the 


characteristic temperature 0: 


Pb Benzene Ag NaCl 
4.881012 3.131012 4.47x 1012 5.841012 9.31012 20.8x1012 38.8x 1022 


Fe Be Diamond 


It can be seen that the limiting frequencies of thermal vibrations lie, 
as was assumed in the example on p. 607, at the boundary between 


the infrared and radio bands (Amin >> 107? cm). 


236. Thermal Expansion 


How can we explain the fact that the average distance between 
neighbouring atoms increases with increasing temperature? To 
answer this question, let us consider the curve of potential energy of 


39—1409 
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interaction between atoms or molecules (see Fig. 268). Irrespective 
of the peculiarities of the interaction between particles, the poten- 
tial curve is always asymmetrical: in the direction of decreasing 


Fig. 268 


separation between particles 
the curve rises steeply, while 
in the direction of increasing se- 
paration a well wall is formed. 
This is due to the follow- 
ing simple fact: practically, 
the separation between two 
atoms or molecules cannot be 
decreased indefinitely, but it 
can be increased indefinitely — 
at great distances the bond 
between the particles is broken. 

The maximum and minimum 
distance between vibrating 
atoms may be noted on a poten- 
tial curve. The middle of the 
segment connecting the two 
limits corresponds to the aver- 
age position of an atom. When 
the temperature increases from 
T, to T, the energy of a vibra- 
ting particle increases and the 
particle passes over to another 


energy level (see Fig. 268). Since the potential curve is asymmetrical, 
the average position of an atom is displaced to the right. Therefore, 


the average separation be- 
tween atoms will be greater 
than the equilibrium sepa- 
ration between atoms at 
rest, i.e., the r correspond- 
ing to the minimum of the 
potential well. Thermal 
expansion results from the 
fact that the average sepa- 
ration between atoms in- 
creases with temperature. 

The thermal expansion of 
a crystal is anisotropic, i.e., 
in different directions the 


Fig. 269 


coefficient of linear expansion œ has different values. Therefore, 
when indicating the value of «, the crystallographic direction of in- 


terest should be specified. 
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When it is required to give a complete description of the thermal 
expansion of a crystal, an expansion diagram may be used. Such 
a diagram is shown for a naphthalene crystal in Fig. 269; a, b, c 
are the crystal axes and Ay, Arr, Ar the axes of symmetry of the 
expansion diagram. The length of a radius vector drawn from the 
origin to a point on the surface of the diagram gives the value of a 


in the given direction. 

The shape of an expansion diagram and its orientation relative 
to the cell axes accord with the symmetry of the crystal. It cannot 
be otherwise since physical properties must be the same in directions 


connected by symmetry operations. 


In order to determine linear expansion coefficients, we must have at our 
disposal means to measure very small displacements with a high degree of 
accuracy. Instruments for measuring thermal expansion are called dilatometers. 
Interference methods (see p. 358) can provide the required sensitivity (several 
hundredths of a micron or better), but these require perfectly ground speci- 
mens, which are very difficult to prepare. 

In practice, quartz differential dilatometers are used. In such instruments, 
the specimen to be measured is placed in a cylindrical holder made of quartz 
glass. At the bottom of the holder a base prism is placed. One end of the speci- 
men rests on this prism. A quartz rod, which transmits the expansion of the 
specimen to the measuring portion of the instrument, rests on the upper end 
of the specimen. The displacement of the end of the rod is measured by means 
of a microscope or rotary mirror. Since the holder as well as the specimen ex- 
pands upon heating, the instrument registers the difference between the coef- 
ficients of expansion of the specimen and the holder. The necessary corrections, 
based on available data on the thermal expansion of quartz over a wide range 
of temperatures, may then be introduced. 

The most accurate method of obtaining an expansion diagram is by means 
of X-ray structural analysis—measurement of the displacements of diffraction 
spots. 
Coefficients of Linear Expansion 


tec ax104, deg-1 
Aluminium . o. 0-100 0.238 
Gypsum’. paS + - =. 12-25 0.025 
Quartz, || to the axis. . 40 0.0781 
Quartz, | to the axis 40 0.1419 
| Eee GS OLS —10-0 0.507 


237. Crystal Imperfections 


Block Structure. The structure of an actual crystal differs consid- 
of an ideal space lattice. This assertion is based on 
including direct electron-microscopic observations. 


39* 


erably from that 
numerous facts, 
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However, our basic knowledge of inner crystal imperfections is de- 
rived, in the first place, from strength measurements. A crystal will 
rupture when it is subjected to a stress of less than a hundredth of 
the stress which an ideal object should withstand. Deformations and 
the rupture of crystals will be discussed in Chapter XXXIV. 
Here, we shall summarise our present knowledge of crystal imper 
fections. 

Briefly, the situation can be described as follows: A monocrystal 
does not constitute a single lattice. It consists of a large number of 
tiny blocks which are slightly misaligned (within the limits of 
several seconds or minutes) with respect to one another. The dimen- 
sions of the blocks may vary within rather broad limits. In most cases, 
they lie in the range of 1076- 
40-4 cm. Plotting of the block 
dimensions of a crystal would 
probably yield a characteristic 
distribution curve. 

Of great interest is the ar- 
rangement of particles at the 
boundary between two blocks. 
There is good reason to assume 
that a liquid surface covered 
with soap bubbles can serve 
as an excellent model. 

: An examination of Fig. 270 

Fig. 270 _ shows that a“fracture” exists in 

the atomic rows close to the 

centre of the model. The portion of the “structure” illustrated in 

the figure can be represented as four blocks with a common corner at 

the centre of the model. An imperfection is clearly visible at the cen- 

tre. Here, the upper “atoms” have not fallen into their right places, 

i.e., they do not fit into the empty spaces of the compact packing 

structure. This fault has led to the splitting up of the crystal into 
blocks. j 

Many such faults—called dislocations—exist in a crystal. They 
are distributed randomly and may turn an atomic row to the left 
or to the right. Therefore, on the average, all crystallographic align- 
ments extend through an entire monocrystal with great exactitude. 
Dislocations give a monocrystal a block (or mosaic) structure. To be 
sure, the presence of chance microfissures, or empty spaces, several 
atoms deep, also facilitate the formation of block structures. 

An examination of the figure shows that a dislocation may also 
be pictured as follows: two adjacent rows, one of which has one par- 
ticle more than the other—an extra atom has “found its way” into 
one of the rows. 
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Dislocations. Fig. 270 represents a two-dimensional model of 
a crystal. It is as if each row were the projection of an atomic layer 
which is oriented perpendicular to the figure. The large fault would 
correspond in a three-dimensional crystal to a linear region perpendic- 
ua the figure. This region may be called the core of the dislo- 
cation. 

Dislocation patterns not only explain the block structure of crys- 
tals, but also many other phenomena. Therefore, it is well to study 
these peculiar distortions of crystals 
in detail. 

There are two kinds of dislocations— 
simple and spiral. The dislocation illus- 
trated by the bubble model is of the 
simple kind. Schematically, such a 
dislocation is illustrated in Fig. 271a. 
The core of the dislocation is designated 
by an inverted T. The distortion is 
maximal near the dislocation plane 
dividing the crystal into two parts 
and rapidly diminishes in either direc- 
tion away from the dislocation line. 
Fig. 271b shows a top view of the two 
adjacent atomic planes on either side 
of the boundary between the blocks. 
The upper, or compressed, plane (des- 
ignated by solid lines) contains one 
row more than the lower one (desig- 
nated by dotted lines). 

Analogous diagrams for a so-called Fig. 274 
spiral dislocation is shown in Fig. 272. 

The lattice is divided into two blocks, one of which has slipped, 
so to speak, a distance of one period relative to the other (see 
Fig. 272a). At the axis shown in the figure, the distortion is max- 
imal. The region adjacent to this axis is called a region of spiral 
dislocation. It will be easier to grasp the essence of this distortion 
if we examine the diagram of the adjacent atomic planes on either 
side of the boundary between blocks (see Fig. 272b). This is the right 
view of the three-dimensional figure. The spiral dislocation axis is 
the same asin the three-dimensional figure. Solid lines denote the 
plane of the right block and dotted lines those of the left. As may 
be seen from the diagram, a spiral dislocation differs from @ simple 
dislocation. There is no extra row of atoms in this case. The distor- 
tion consists in the fact that the atomic rows change their closest 
neighbours near the dislocation axis, i.e., they bend and drop down 


one level. 
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Why is this called a spiral dislocation? This can be explained as 
follows. Let us move around the dislocation axis—along the nodal 
planes of the lattice—beginning at the lowest plane. After each rev- 
olution, we are one level higher. In this manner, we reach the top 
of the crystal, much the same as having climbed a spiral staircase. 


p 


In Fig. 272, the spiral motion would be counterclockwise. If the 
blocks were displaced in the opposite direction, the spiral motion 
would be clockwise. 

In a given object, one may encounter successive spiral dislocations 
having the same rotation direction. If two dislocations having 
different rotation directions are in the same plane, the resulting 
distortion is more complex. 


Imperfections within a Block. A crystal lattice may consist of 


blocks which also have imperfections. These imperfections may be 
in the form of lattice vacancies or foreign atoms. A very small num- 
ber of lattice vacancies and foreign atoms can result in considerable 
distortion. 

Fig. 273 shows the nature of these distortions. In (a) we see the 
effect of a foreign atom that replaces one of the basic atoms of a lat- 
tice, in (b) the effect of a foreign atom penetrating between basic 
atoms, and in (c) the effect of a lattice “vacancy”. The disturbance 
may extend to a depth of 5-410 lattice spacings in each direction. 

_ This encompasses a region of the order of 1,000 cells. Therefore, an 
impurity of the order of 0.4 per cent may fundamentally change 
the properties of a crystalline substance. (It should be noted, how- 
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ever, thal an impurity does not produce appreciable lattice distor- 
tion.) In Sec. 272 we shall deal with the case of semiconductors in 
which impurities of the order of one 
part in a thousand million may change 
the electrical properties of a body. 


238. Short-Range Order. Liquids 


Tt was seen at the beginning of this 
chapter that very many solids can be 
represented as compact packings of 
spheres. In such structures, the fraction 
of the total volume consisting of empty 
space is equal to 26 per cent. 

Copper is an example of such a 
crystal. How does the structure of a 
piece of copper change when it is 
melted? Experiments show that the 
volume increases by about 3 per cent. 
This increase is due to an increase in 
empty space, which now equals 29 per 
cent of the total volume instead of 
26 per cent. The compact structure 
has loosened and the spheres are able 
to move away from their “proper” posi- 
tions. The ideal order which is charac- 
teristic of a crystal has been dis- 
turbed. 

As a result of thermal motion, the 
spheres vibrate, in general, about their 
equilibrium positions and remain sur- 
rounded by the same neighbours. Now 
and then, neighbours may change when 
a space of the same size as the volume 
of a sphere happens to be created in 
the vicinity of the sphere. Owing to 
the closeness of particles in a liquid, 
a so-called short-range order arises. In 
a model of spheres, one sphere cannot 
approach another closer than the diam- 
eter of a sphere. Such a deviation 
from ideal randomness occurs in gases 
as well, but it is of little importance 
there since the closest molecules in a gas are separated, on the average, 
by a distance ten times as great as the dimensions of a molecule. 


Fig. 273 


wa 
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Let us examine a molecule in a liquid and imagine two concen- 
tric spheres about it. Assume the radius of one is equal to that of the 
molecule and the radius of the other to three times this value. On the 
average, how many close neighbours does such a molecule have? By 
close neighbours we mean molecules located in the region between 
concentric spheres. Consider the example of copper, the volume of 
which increases by 3 per cent when it is melted. According to cal- 
culations there are, on the average, 11.6 atoms in the region under 
consideration. Thus, there are to be found about 42 close neighbours 

the centres of which are separated from 


- Ur) s the given atom by a distance equal to its 
2 diameter. Closer neighbours are not to be 
found. 
1 


It is clear that the short-range order 

affects not only close neighbours but suc- 

2 4 6 & 10 A Cessive ones as well. Therefore, it is cus- 
tomary to describe the short-range order 

Fig. 274 by the average density of radial distri- 

bution of atoms. 

Let us imagine two concentric spheres of radius r and r + dr 
about an atom. For simplicity, assume we are dealing with a mona- 
tomic liquid. The volume of this spherical shell will be 4nr? dr. The 
number of atoms falling within this shell may be expressed as 


U(r) x 4ar? dr, 


where U(r) is the density of radial distribution of Mone 

A U(r) curve for amorphous arsenic is shown in Fig. 274. Maxima 
of tbe curve indicate that certain interatomic spacings acquire con- 
siderably more “weight” than others. The origin of successive maxima 
is exactly the same as of the first. The density of packing is such as 
to allow the number of closest neighbours of a given atom to fluc- 
tuate only within very narrow limits, while the number of neigh- 
bours closest to the closest may fluctuate within somewhat broader 
limits. As the distance from the central sphere increases the random 
order becomes more and more evident. The U(r) curve approaches 
a limit, i.e., the short-range order fades away and gradually passes 
over into a random order. It is convenient to set the value of U(r) 
equal to unity at r— oo. This distinctive order with regard to close 
neighbours, which fades away as the distance from the atom or 
molecule under consideration increases, is what is meant by a short- 
range order. 

The order in the arrangement of particles characteristic only of 
crystals is called a long-range order. This means that the three-di- 
mensional periodicity peculiar to a crystal does not fade away at 
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great distances. The arrangement of atoms along a nodal line regu- 
larly repeats itself thousands and millions of times. 

When we discussed the structure of crystals, we saw that atoms by 
no means always behave like spheres. This applies to liquid struc- 
lures as well. 

In an ideal case, the short-range order in an atomic liquid should 
result in the number of close neighbours being equal to almost 
twelve. Experiments show that metals the crystal structures of which 
consist of compact arrangements of spheres continue to have such 
a short-range order, i.e., an average number of close neighbours 
just short of twelve, after being melted. 

As indicated above, every atom of lithium, sodium and potassium 
in a crystal has eight close neighbours. The same short-range order 
is preserved in a liquid, but the average number of close atoms be- 
comes somewhat greater than eight. 

Simple substances in the crystalline state of which the atoms are 
firmly bound to a small number of neighbours behave differently. 
These bonds are broken when such substances are melted and the 
number of close neighbours per atom of fluid becomes greater than 


in a crystal of the same substance. 


239. Amorphous Bodies 


The word “amorphous” means “without form”. Amorphous solids 
are the antitheses of regular polyhedral crystals. However, the shape 
of a polycrystalline body is not regular even though it is not amor- 
phous. How then may crystals and crystalline bodies be recognised? 
They may be recognised, primarily, by their well-defined melting 
points. If heat is applied to a crystalline body, the temperature of 
the body increases until it begins to melt. Thereupon the tem- 
perature ceases to rise and the entire melting process takes place at, 
the melting temperature. 

Ordinary glass is a typic 
it is heated and gradually goes over 


perature is raised. 
This behaviour’of amorphous bodies can be explained by their 


structural peculiarities, leading to the classification of such bodies 
as liquids rather than solids. 
As indicated, there exists a lon 
line bodies. In amorphous bodies, 
occurs and in this respect such 
Fig.. 275a shows the structure 
Fig, 275b the structure of quartz gl 
a substance can be obtained in a cryst 
amorphous form. The similarities and differe 


al amorphous solid. It grows soft when 
into the liquid state as the tem- 


g-range order of particles in crystal- 
only a short-range order of particles 
bodies do not differ from liquids. 
of quartz (silicon dioxide), and 
ass. From the chemical viewpoint, 
alline form as well as in an 
nces between these two 


or 
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states can be clearly seen in the figure. Apparently, an amorphous 
body is a “damaged” crystal. In a crystal and an amorphous body, 
the number of close neighbours and the nature of the encirclement 
are the same. Possibly, a pentagonal ring is particularly advanta- 
geous energetically for SiO, groups. Since the symmetry of an axis of 
fifth order cannot produce a periodic structure (see p. 592), amor- 
phous glass is obtained. 

The absence of a long-range order, which is characteristic of crys- 
talline bodies, is the immediate cause for the absence of a well-de- 


fined melting point. At the melting point, a transition occurs and 
the long-range order disappears. Only a short-range order in the 
arrangement of atoms remains. 

In amorphous bodies, the nature of the arrangement of atoms does 
not change when the temperature is increased. Only the mobility 
of the atoms changes, i.e., the atomic vibrations increase. At first 
only a few atoms are able to escape from their encirclement and 
change neighbours. This number gradually increases until, finally, 
the rate of such changes becomes the same as in water. 

The ease with which a given molecule may change its neighbours 
is related to an important property of a liquid, namely, its viscosity. 
The less frequently neighbours are changed in a liquid, the thicker, 
i.e., the more viscous, the liquid. Of course, an increase in tempera- 
ture, which increases the swing of molecular vibrations, results in 
a decrease in viscosity. It is also quite understandable that, tempera- 
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ture conditions being equal, the liquid whose molecules are more 
complex will be more viscous. Many liquids harden before they be- 
come very viscous. The high viscosity of glue, honey, tar and oil is 
due to the complex form of their molecules. 

When aliquid hardens, the exchange of molecules practically ceases. 


240. Short- and Long-Range Order of Atoms in Alloys 


When two or more substances crystallise together, they may in 
certain cases form a common crystal lattice. It depends on the relative 
values of the energy of interaction between homogeneous and heter- 
ogeneous particles whether such a mixed crystal is formed or not. 


If the attraction between homogeneous particles is greater than be- 
tween heterogeneous particles, a mixed crystal is not. formed. 

Metal alloys, which are widely used in various industries, are 
mixed crystals. By reference to the structure of alloys, we can 
clarify the concepts of short- and long-range order. i 

In the simple case of diatomic alloys, we may encounter perfectly 
ordered structures in which a definite cell can be distinguished and 
the substance described as a crystal of a compound with the definite 
formula AnBm. However, this does not always occur and in a num- 
ber of cases A atoms randomly replace B atoms in their lattice or, 
if they are small, randomly become lodged between B atoms, 

We shall discuss only a substitution alloy, namely, iron-cobalt 
(see Fig. 276). This alloy has a simple body-centred lattice structure. 
Each atom—iron as well as cobalt—has eight close neighbours. As 
regards the arrangement of atom centres, an alloy crystal is always 
perfectly ordered, i.e., the atom centres form the same body-cen- 
tred lattice under all conditions. The situation differs with respect 
to the distribution of iron and cobalt atoms.. Let us consider the 
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cell points of a crystal to be divided into corners and centres of cubes. 
For perfect order, all corners are occupied, say, by iron atoms and 
all centres by cobalt atoms (Fig. 276a). The ideal long-range order 
of such a crystal may gradually deteriorate if atoms begin to occupy 
“foreign” sites. But as long as the number of atoms located at their 
“own” sites differs from the number of atoms located at “foreign 
sites (Fig. 276b), the crystal may be said to have a long-range order, 
even though the order is, in part, “impaired”. The long-range order 
disappears when the distinction between “foreign” and “own” sites 
is lost, i.e., when half the atoms are at their “own” sites and half 
at “foreign” sites (Fig. 276c). 

It is important to note that when a completely ordered crystal 
is heated, the order is gradually destroyed, i.e., the percentage of 
atoms at “foreign” sites increases. There exists a temperature above 
which even a partially “impaired” long-range order cannot exist. 
This temperature is called the -point (lambda-point). For an iron- 
cobalt alloy, the A-point is 770°. The transition from order to dis- 
order signifies that thermal motion has gained the upper hand over 
the “tendency” of atoms to maintain a long-range order. 

There is a great deal in common between the process of eliminat- 
ing the distinction between “foreign” and “own” sites and the melting 
process. Both processes result in the disappearance of a long-range 
order. However, melting results in the disappearance of the long- 
range order of atom centres, while passing through the A-point 
results in the disappearance only of the order in the arrangement 
of atoms of different elements. 

The basic characteristic of the structure of alloys of the iron- 
cobalt type is the possible existence of a partial long-range order. 
Such a partial long-range order can exist only with respect to the 
distribution of the iron or cobalt atoms, but not with respect to 
the arrangement of atom centres. 

Like in the case of melting, the elimination of a long-range order 
does not mean the elimination of order in general (a short-range 
order remains). 

The short-range order with respect to the distribution of atoms in 
iron-cobalt crystals consists in the “tendency” of cobalt atoms to 
surround themselves with iron atoms, and vice versa. If we take any 
atom and examine its eight close neighbours, we will find that the 
number of atoms of the other element will not be equal to one half 
of the total, i.e., four. Depending on the degree of perfection of 
the short-range order, an iron atom may be surrounded, on the aver- 
age, by five, six or seven cobalt atoms. 

The investigation of a copper-gold alloy shows that its short-range 
order has a high degree of perfection. This pertains not only to the 
number of closest neighbours, but to the number of closest to thé 
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closest, etc. If a number of spheres are drawn about any of the gold 
atoms, it is found that the first shell contains, in effect, only copper 
atoms, while the second contains only gold atoms. In successive 
shells, the short-range order begins to deteriorate, but a predilec- 
Lion for atoms of a definite element will be felt as far away as the 
tenth shell! 

It has been determined by means of very precise investigations 
using X-rays how a long-range order in alloy crystals is “created”. 
Experiments with cobalt-platinum alloys have shown that the re- 
gions of long-range order grow in a disordered crystal as crystal nu- 
clei grow in a liquid. These embryonic regions are arranged in a per- 
fectly definite manne® relative to the axes of a crystal. 


241. Liquid Crystals 


To find examples of liquid crystals, one must turn to organic sub- 
stances. Molecules of substances forming liquid crystals are always 
elongated. Liquid crystals are encountered among viruses, and also 
among li oids, which are components of living tissue. 
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Fig. 277 


A substance forming liquid crystals exists as such in a definite 
temperature range- If a liquid crystal is heated, it turns into an 
ordinary liquid; if it is cooled, it becomes a crystal. 

The term “liquid crystal” is derived from the strange manner in 
which the properties of a liquid and a crystal are combined. A liq- 
uid crystal possesses fluidity and forms drops. However, these drops 
may be elongated rather than spherical, and somewhat resemble jel- 
ly. Careful investigation shows that the order of molecules in such 


a drop is not like in ordinary liquids. 
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Two kinds of liquid crystals are known. In one, the molecules are 
in a short-range order and parallel to one another. In the other, the 
order of molecules is even more peculiar. Here, the molecules are 
arranged in layers. Each layer consists of parallel molecules, which 

are in a short-range order. This is illus- 
‘trated in Fig. 277. 

$ A soap solution consists of liquid crys- 
tals. The washing properties of soap are 
directly related to its ability to form 
| I Il l L liquid crystals. A moleclue of soap has an 
elongated shape (transverse axis measur- 
ing about 4 Å and longitudinal axis 
30-40 A). At one end of a molecule a 
negative electric charge is concentrated 
and this pole is attracted by water mole- 
cules. A soap solution is a liquid crystal. 
It consists of a large number of double 
layers of molecules, separated by layers 
of water (see Fig. 278). In the double 
layers, the poles of the molecules are 
turned outward, i.e., toward the water. 
The molecules of soap within a layer are 
in a close arrangement and ina short-range 
order. If there is little soap in the water, 
the double layers of soap molecules are 
separated by large layers of water, As 
more soap is put into the water, more and 
more double layers will be produced. The solution becomes saturated 
when the thickness of a water layer equals about 20 A. The double 
layers forming a liquid crystal possess great mobility. When we 
wash our hands, the layers slide easily relative to one another and 
the skin. Dirt from our hands collects at the poles of the molecules 

and is then released in the water. 


242. Polymers 


A large number of organic substances, composed of giant molecules 
consisting of thousands of atoms, have a peculiar structure. Such 
substances include plastics, kapron and artificial silk. The molecules 
of these substances consist of identical groups of atoms arranged 
in a chain (whence the term “polymer”). The atoms within a mole- 
cule are frequently in a long-range order. 

The properties of high-polymers having lateral chemical bonds 
between chains differ significantly from those of so-called linear 


l 
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polymers, in which such bonds do not exist. Polymers having later- 
al chemical bonds between chains are rigid systems. Their atoms 
are arranged rather loosely and in a completely haphazard manner. 
Plastics of such polymers are used in the manufacture of buttons, 
kitchen utensils and various fittings. Linear polymers have interest- 
ing properties and structures. 


Although a number of points still 4 
remain unclear, the basic struc- Lf Ly bg by Ly 
tural features of these substances CEE yy 
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The role of a stack in a linear i b) 
polymer is somewhat analogous to 
the role of a crystalline particle 
in a polycrystalline substance. og 
Nevertheless, there is a signifi- IN Gl Ca) 
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of parallel chains forming a stack 

(consisting of thousands or tens of c) 
thousands of such chains) may vary 

considerably from case to case. Fig. 279 
Fig. 279shows three kinds of order: 
a) crystal type—the axes of the chains form a perfect lattice and the 
azimuths of the chains are ordered; b) gas-crystal type—the axes of 
the chains forma lattice and the azimuths are disordered; and 
c) amorphous or liquid type—no lattice is formed (absence of long- 
range order). It should be noted that the displacements of the chains 
relative to one another in the longitudinal direction also may be 
ordered or disordered. Since a stack of chains, each of which con- 
sists of hundreds of thousands or millions of atoms, is very long 
and will constitute, therefore, a broken formation, it is evident that 
even in the case of ideal order in the arrangement of chains the order 
of a stack is not entirely like that of a crystal. Only the portion of 
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a stack for which the parallel chains are rectilinear can have a crystal 
arrangement. This means that in the case of ideal order each stack 
of chains consists of a sequence of crystalline regions (crystalline 
particles). 

The stretching of linear polymers consists in the unfolding of 
stacks of chains. A similar mechanism of elongation enables us to 
explain extensions of up to 1,000 per cent occurring in certain nal- 
ural and artificial high polymers. 

Rubber and polyethylene are the best known linear polymers. 
The high polymers used in the manufacture of artificial fibre also 
have the same kind of structure. 


CHAPTER XXXIII 
PHASE TRANSFORMATIONS 


243. Phase Diagrams 


A substance can have only one gas state and one liquid state, but 
it can have several crystal states (also, several liquid-crystal and 
gas-crystal states). 

The gas and liquid states of a substance are characterised by disor- 
der in the arrangement of particles. Ina gas, the ratio of the kinetic 
energy of the particles to their potential energy of interaction is such 
that the binding forces cannot yestrain the particles from flying apart 
to the extent to which the vessel containing the gas permits. The 
liquid state has a definite form since the binding forces do not permit 
the molecules to have independent free paths.* At high pressures, 
the distinction between a gas and a liquid disappears. 

Since two basically different random arrangements of particles 
do not exist, every substance has one liquid and one gas state. A crys- 
tal is characterised by a definite arrangement of particles and, in 
principle, an infinite number of different crystal phases can exist 
for a given substance. In actuality, several different crystal phases 
exist, as a rule, for one and the same chemical compound (diamond 
and graphite, white and gray tin, yellow and red sulphur, etc.). 

Every substance exists in one or another phase, depending on the 
external conditions, viz., the temperature and the pressure. It is 
customary to use a pressure-temperature diagram, instead of a 
table, to describe the conditions for the existence of the various 
phases of a given substance. A diagram of this kind is known as 
a phase diagram. 

Three such diagrams are shown in Fig. 280. In the upper left-hand 
corner, we see a phase diagram for an ideal substance having only 
one solid phase. The diagram is divided into three regions:. one Te- 
gion indicates the conditions for the existence of a crystal, another 
for the existence of a liquid, and the third for the existence of a gas. 
The gas state, of course, is represented by the lower right-hand por- 
tion of the diagram, i.e., where the temperatures are low and the 
pressures high. The solid phase is represented by the region of lowest 
temperatures and highest pressures. Such a diagram is very conveni- 
ent. In order to determine the state of a body at a pressure p and 


* In the absence of gravitational forces, a drop of liquid is spherical. 
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a temperature T, all we need to do is find this point on the diagram 
and observe in which region it is located. 

In the upper right-hand corner of Fig. 280, the conditions forfthe 
existence of the various phases of sulphur are shown. This substance 
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Fig. 280 


has two crystal phases and therefore the diagram is divided into 
four parts. In the lower part of Fig. 280, the phase diagram for water 
is shown. Since it is difficult to represent such a diagram drawn to 
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linear scale on a single drawing, the pressure in this diagram has 
been plotted logarithmically. It will be seen that ice exists in five 
different phases, which are designated by the Roman numerals 1, Il, 
III, Vand VI. The phase which was originally designated as IV turned 
out to be an error. The less common phases of ice exist at higher 
pressures. 

On a phase diagram, dividing lines between phases, as well as 
points within the various regions of the diagram, have physical sig- 
nificance. At pressures and temperatures corresponding to points on 
the dividing lines, two boundary phases exist simultaneously. In 
the case df water, this would correspond to the condition of ice 
floating on water, with theice not melting and the water not freez- 
ing. The dividing lines may be called phase equilibrium curves. 

It should be emphasised that these are phase equilibrium curves 
and not points. This means that equilibrium between two phases 
can be realised at different temperatures if the pressure is varied 
accordingly. This may also be expressed as follows: the temperature 
of phase equilibrium is a function of pressure; or, the pressure of 
phase equilibrium is a function of temperature. 


244. Phase Transformations 


Phase equilibrium curves may also be called phase transformation 
curves since transition from one phase to another occurs when this. 
line is crossed. 

The dividing line between a solid and a liquid is the fusion, or 
crystallisation, curve; and the dividing line between a liquid and 
a vapour is the yaporisation, or condensation, curve. We call the 
dividing line between a solid and a vapour the sublimation curve and 
the lines between two solid phases simply transformation 
curves. alg ess 

Processes involving a change of state are also conveniently indi- 
cated on a phase diagram. Usually, we are concerned with transfor- 
mations occurring at constant temperature or at constant pressure. 
These processes are aa on a diagram by vertical and hori- 
zontal lines, respectively. 

Se ered phase P raora ODs are illustrated in Fig. 280. Line 
2-1 on the phase diagram of sulphur represents a cooling process for 
a sulphur gas at normal pressure. At a temperature of 444.5°C sul- 
phur is transformed from a gas into a liquid, at 110.2°C from a liq- 
uid into a crystal phase and, finally, at 95.5°C from this crystal 
phase into another crystal phase. The compression of sulphur gas is 
illustrated in the same diagram by the process 3-4. By increasing the 
pressure, we are able in this case too to transform a gas into a liquid 
and then, at very high pressures (above point 4) into solid states. 

40* 
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Under certain unique conditions, three phases may exist together 
simultaneously. Such points are called triple points. Sulphur has 
three triple points: 1) gaseous, liquid and yellow sulphur existing 
simultaneously; 2) gaseous, liquid and red sulphur existing simul- 
taneously; and 3) a liquid existing in equilibrium with the two 
crystal phases. ics 

It can be proved on the basis of strictly thermodynamic principles 
that a quadruple point cannot occur. Thus, no conditions exist under 
which, for example, two crystal phases are in equilibrium with 
a liquid and a vapour. ai 

Every phase transformation is characterised by a transition tem- 
perature at a given pressure. We speak of the melting (crystallisa- 
tion), boiling, sublimation, etc., points of a substance. If the pres- 
sure is not indicated, this usually means that the transformation 
occurs at normal atmospheric pressure. 

An important characteristic of a transformation is its heat of tran- 
sition. The occurrence of latent heat of vaporisation and heat of 
fusion is well known, but heat of transition is a general phenome- 
non. A transformation which proceeds by heating absorbs heat. In 
accordance with the second law of thermodynamics, heat of trans- 
formation is uniquely related to a change in entropy: 


AQ=TAS, 


where T is the transition temperature. Therefore, it is evident that 
a phase transformation which proceeds by heating is accompanied 
by an increase in entropy. 

The transition (fusion, boiling) temperature can be calculated from 
the formula 


T-a 


` 


i.e., it is equal to the quotient of the latent heat of transition divid- 
ed by the increase in entropy. But in this form the statement is of 
purely theoretical significance since practically the change in entro- 
py for a phase transformation cannot be precalculated. However, 
knowing the transition temperature and heat of transition from 


experiments, one- can determine accurately the magnitude. of the 
increase in entropy. = 


The phase transition from ice I to ice III occurs at a temperature t = — 20°C 
and a pressure p = 2,103 atmospheres. This transition is accompanied by the 
release of heat; each gram of ice releases AQ = 5.6 cal. Therefore, the change 
in entropy is 

; AQ 5.6 


mA A y 
AS T= 553 0.022 cal/deg. 
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245. Phase Stability 


How do we explain the fact that under certain conditions a body 
constitutes a liquid and under others a solid? There are two tenden- 
cies which determine the nature of a state under given external con- 
ditions: first, the tendency of a body to possess a minimum of energy 
and, secondly, the tendency to possess a maximum of entropy. The 
first tendency is a consequence of the fact that a system of molecules 
behaves like any system of mass points subject to the laws of New- 
tonian mechanics and, as we know, a mechanical, system tends to 
possess a minimum of potential energy. The second tendency is 
a consequence of the second law of thermodynamics. 

During the transition from a gas to a liquid and during the transi- 
tion to a solid, the internal energy of a substance decreases. The ener- 
gy of a gas is higher than the energy of a liquid since in passing from 
a liquid to a gas work must be expended in overcoming the binding 
forces between molecules. And the energy of a crystal is lower than 
the energy of a liquid since an ordered arrangement of interacting 
particles is always more stable than a disordered arrangement. This 
can be proved rigorously, but the proof will not be presented. The 
statement appears rather obvious. Imagine, for example, a perfect 
lattice of spheres connected by means of springs. Any displacement 
of a sphere requires a certain amount of work. Hence, an ordered 
arrangement corresponds to a minimum of energy. 

Entropy behaves in a different manner. Roughly speaking, the 
freedom of motion of the constituent particles of a body, 
ts entropy. A disturbance in the order or an increase in 
the separation between particles results in an increase in entropy. 

Thus, for a given temperature and pressure, the state of a substance 
is established as a compromise between entropy and energy- Using 
the second law of thermodynamics, we can obtain a quantitative 
expression for this general law. 

Imagine that a body is placed under “foreign” conditions, i.e., 
ice under the conditions for the existence of steam, etc. In such 
a case, an irreversible phase transformation (fusion, vaporisation, 
etc.) will occur in accordance with the second law of thermodynam- 
ics: the increase in the entropy of a body will be greater than the 


applied reduced heat, 


greater the 
the greater i 


aspen 


Using the first law of thermodynamics, we can rewrite the inequali- 


ty in the form 
O a and dU —T dS +pav <0. 
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Since the phase transformation occurs at constant temperature, we 
obtain 


d(U—TS)+pdv<0. 


If the process is not accompanied by a change in volume, the 
transition to a state of equilibrium takes place with d (U — TS) < 
< 0, i.e., with a decrease in the quantity F = U — TS. This func- 
tion is called the free energy. We have shown that a spontaneous phase 
transformation is accompanied by a decrease in free energy, i.e., 
the free energy of a stable state must be a minimum. 

If the process occurs at constant pressure, the transition to an 
equilibrium phase takes place with d (U — TS + pv) <0, i.e., 
with a decrease in the quantity P = U — TS + pv. This function 
is called the thermodynamic potential. Thus, a phase transformation 
at constant pressure is accompanied by a decrease in thermodynam- 
ic potential, i.e., the thermodynamic potential will have a mini- 
mum value at equilibrium. 

The opposing tendencies of entropy and intern 
out in this statement: a decrease in energy 
result in a decrease in free energy or 
These two tendencies have been expres 
relations showing that F and @ tend t 

The formulated condition for 
applications. For example, using 
relation for the slope of a phase equ 

On such a curve, consider two 
conditions 7,, p, and T 
points have the form 


O; (T1, Py) = Dz (74, py) and ®©; (Ta, Po) = O; (To, Po). 
The subscripts of ® refer to the phases in equilibrium. Subtracting 
the first equation from the second, we obtain 
Dy (Tz, Po) —D, (Ti, Py) = Dy (Tz, po) — Dy (Ti, pi). 


Let us assume that the two points are close to each other. Then, 
by means of the formula for the increment of a function of two 
variables, we can transform the above equation into the form 


al energy are brought 
and an increase in entropy 
thermodynamic potential. 
sed quantitatively in the 
o a minimum. 

phase equilibrium has numerous 
this condition, we can derive a 
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points representing the external 
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a, aM, 5 - Dy ADs 
ap af + Jp dp IT dT + ap dp. 


Substituting the values of the derivatives of the function ® = U — 
aD aD vé 
— TS + pv, namely, 37 = — S and Op T% we obtain 
ap: Ss WS 
aT o m= ¥y— 09° 
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But since A = ; 
dp AQ 


af Tava (Clapeyron-Clausius equation). 


Thus, the slope of the curve (the derivative a) is determined 
by the latent heat of fusion AQ, the temperature of the phase transi- 
tion 7, and the difference in the volume of the phases. If AQ is posi- 
tive, this means that the subscript 1 refers to the high-temperature 
phase. 


Let us apply the Clapeyron-Clausius equation to the case of melting ice. 
When ice melts, 1 cm? of water is obtained from 1.091 cm? of ice. The volume 
change vy — və is equal to —0.091 cm? (the volume decreases). In this case, 
AQ will be the heat of melting and is equal to 80 cal/gm. The temperature T. 
equals 273°K. Hence, 


aT T X åv (273) (— 0.091) 0.34 deg cm? 
ap =| AQ * 80 A cal i 


Since the dimensions of the above result somewhat obscure its significance, 
let us convert calories to atmospheres, recalling that 1 cal = 42.7 kg em ~ 
~ 42.7 atm cm’. We obtain 


a _ _,0075 £28. , 
dp atm 


Thus, increasing the pressure by 1 atm decreases the melting point of ice by 
0.0075 degree. 
246. Metastable States 


The above thermodynamic explanation of phase transition phe- 
nomena does not explain a number of observed facts. Thus, from the 
thermodynamic viewpoint, for a given p and T there can occur a single 
state (a point in one of the regions of a phase diagram) for which 
the free energy, Or thermodynamic potential, assumes a minimum 
value. However, it is possible for graphite and diamond to exist side 
by side, and water can be obtained under the conditions for the 
existence of ice (supercooled water). Numerous other examples in 
which the above thermodynamic principles are violated may be cited. 
The situation can be described as follows: In addition to states 
which are stable under given external conditions, so-called meta- 
stable states may also exist. 

The free energy of a metastable state is not a minimum, but nev- 
ertheless the transition from this state to a state having a minimum 
energy is impeded. Different metastable states may differ considera- 
bly in their degree of stability. Sometimes a slight impulse suffices 
for a transition to occur to a “normal” state, while in other cases a 
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a metastable state may be, in actuality, no less stable than “normal” 
state. 

Various phase transformations can be delayed. Thus, water can be 
supercooled, i.e., at normal pressure, water may exist at a tempera- 
ture below 0°C; water can also be superheated, i.e., its temperature 
may be raised above 100°C without boiling. A vapour also may be 
obtained under atypical conditions (a supercooled vapour is said to 
be supersaturated). Transformation delays always occur in the solid 
state, i.e., the transformation of one crystal phase into a-second is 
delayed even though the conditions prevailing are those for the 
stable existence of the second phase. 

However, one type of transformation, namely, fusion, is never 
delayed. Thus, a crystal cannot exist under conditions that are 
stable for the liquid phase. 

We frequently have occasion to deal with supercooled liquids. 
Liquids such as glycerine considerably increase in viscosity when 
supercooled and may remain in the amorphous state for months or 
even years. Glass is another example of a supercooled liquid. 

The existence of a metastable state can be demonstrated in the 
case of a supercooled liquid by bringing the liqùid into contact 
with a crystal. In such cases, crystallisation begins immediately. 

If the liquid is highly supercooled, the effect will be extremely vio- 
lent. When a snowflake is thrown into supercooled water, ice nee- 
dles dart through the water in all directions and in a few seconds 
the transformation is complete. 

; Delays of crystal-crystal transformations are particularly interest- 
ing. Here, delays can occur, so to speak, in both directions. Yellow 
sulphur should be transformed into red sulphur at 95.5°C. If sul- 
phur is rapidly heated, this transformation point may be “skipped” 
and the sulphur may be brought to fusion at a temperature of 113°. 
Now assume that the melt is gradually cooled. At 143° small crystals 
of red sulphur are formed. Cooling does not result in a transformation 
at 95.5°, and even at room temperature small crystals may exist for 
a considerable period of time. However, the transformation process 
proceeds, even though slowly, and within a day is complete, i.e., 
a yellow powder is obtained. Here, too, the metastable nature of 
the state is best demonstrated by dropping a small crystal into 
the melt. 3 

In certain cases, we are interested in a sūbstance in a phase which 
might be expected to exist under entirely different conditions. An 
example of this is white tin which is transformed into gray tin when 
the temperature is reduced to 13°C. Usually, we are more interested 
in white tin and are cognisant of the fact that in winter nothing can 
be done with it. However, white tin excellently withstands 20-30° 
of supercooling and only under severe winter conditions does it 
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begin to be transformed into gray tin. (The members of the Scott 
expedition to the South Pole perished as a result of ignorance of 
this fact. Liquid fuel taken on the expedition had been placed in con- 
tainers soldered with tin. At the extremely low temperatures prevail- 
ing, the white tin was transformed into a gray powder. As a result, 
the containers opened and the fuel was lost.) 

To explain transformation delays, let us consider the difference: 
between liquid-crystal and crystal-crystal transformations on the: 
one hand and crystal-liquid transformations on the other. In the 
last case the long-range order of atoms disappears, while in the 
first two a long-range order is created. The elimination of long- 
range order does not require a great effort. Fusion begins at the sur- 
face; atom after atom is torn away from its neighbours and falls out. 
of strict order. 

On crystallisation, short-range order is transformed into long- 
range order. The process begins at the surface and must proceed in- 
wardly, i.e., into the substance. The atoms, or molecules, are “forced” 
to establish strict order under extremely crowded conditions. Their 
motions must be harmonised for order to be established. As we have 
seen, the rearrangement of atomic order, which requires that atoms: 
undergo “organised” displacements from certain ordered positions- 
- to others, is all the more difficult. 

Transformations in the solid state always begin at the boundaries- 
of grains, blocks, empty spaces, and at dislocations; in other words, 
wherever there is more freedom. If only several score atoms have 
occupied positions corresponding to a new order, oriented growth 
of the nucleus proceeds, i.e., one after another atoms begin to pass 
from the old, less favourable order, or in the case of crystallisation 
from disorder, to the new order. This is the effect of a crystal parti- 
cle, or seed, which invariably puts an end to a supercooled state- 
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the separation of fast-moving particles 
from the surface of a liquid. Two conclusions immediately follow 
from this, namely, vaporisation increases with increasing tempera- 
ture and requires the application of heat. If the vaporised molecules 
are continuously removed from the surface of the liquid, the vapor- 
isation process continues until all of the liquid has been trans- 


formed into vapour. 
Let us consider vaporisa 
only do molecules separate 


Vaporisation consists in. 


tion in a closed vessel. In such a case, not. 
from the surface of a liquid, but the re- 
verse process also occurs, namely, vapour molecules return to the 
liquid. The vaporisation process will continue until dynamic equi- 
librium corresponding to the given temperature has been estab- 
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lished. Of course, the liquid may completely vaporise without equi- 
librium being established with the vapour. 

When equilibrium exists, we say that a vapour is saturated. The 
pressure of a saturated vapour is a function of the temperature and 
is given by the phase equilibrium curve. By changing the tempera- 
ture, we either vaporise more of the liquid in a vessel or condense some 
of the vapour. This results in a 
change of the vapour pressure. 

It is clear why the density and 
pressure of a saturated vapour in- 
crease with temperature. The num- 
ber of molecules leaving a liquid 
rapidly increases as the kinetic ener- 
gy of the molecules increases. On the 
other hand, the number of vapour 
molecules returning to a liquid is 
almost independent of the tempera- 
ture since such a process requires no 
energy. 

The density of saturated vapour 
at a given temperature varies from 
substance to substance within a broad 
range. At room temperature, the 
density of saturated steam is equal 
to 13 mm, while that of saturated 
l mercury vapour is only 0.005 mm. 
Fig. 281 A clear picture of transition 


be obtained b Ain processes from gases to liquids may 
y considering isothermic compression of a gas, i.e., 


“vertical” processes in a phase diagram. In order to represent volu- 
metric changes, which are not shown in a phase diagram, let us draw 
an auxiliary diagram in which pressure is plotted as a function 
of volume (see Fig. 281). 

If gas compression occurs at a sufficiently low temperature, sooner 
or later we arrive at an intersection point with a phase equilibrium 
curve. At this instant, the pressure is equal to that of saturated 
vapour at the temperature of the experiment and the first drops of 
liquid appear. As long as a vapour is not completely transformed into 
liquid, the compressing motion of the piston will not be accompa- 
nied by a change in pressure since we remain at the same point in the 
phase diagram from the beginning to the end of condensation. The 
condensation process will be indicated by a horizontal line on the 
pressure-volume curve. 

‘The significance of points of a rectilinear segment on the diagram 
is clear. They describe a two-phase liquid-vapour system. Hach 
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point of such a segment corresponds to a definite ratio between the 
phases, which can be easily determined by means of a “lever” rule. 
Let us designate the volume of the liquid by v;, the volume of the 
vapour by vz, and the proportion of substance in the liquid state 
by x. Then, the volume of the wet mixture is given by 


v= zv; + (1— T) vo. 
Hence, 
pa tae 
ve—vy 
This is the “lever” rule. 

When the condensation process is completed, a steep rise occurs 
in the curve since liquids have very low compressibility. 

Now, let us increase the temperature to that of the next isotherm 
on the diagram. It will be practically the same as the first, except 
for one important difference, namely, condensation begins later 
since the pressure of a saturated vapour is greater at a higher tempera- 
ture. Moreover, condensation is completed earlier since the piston 
does not reach its preceding position owing to the thermal expan- 
sion of the liquid. 

By increasing the temperature further, we obtain a series of iso- 
therms in which the horizontal segments corresponding to two-phase 
systems become shorter and shorter. Finally, this segment disap- 
pears entirely. The decrease in the length of the horizontal segment 
indicates that the specific volume of the liquid is approaching that 
of the vapour. At a certain critical temperature, these volumes be- 
come equal and the isotherm no longer has a horizontal portion. In 
the pressure-volume diagram, the critical point is easily determined 
as the apex of the dotted two-phase region. In a diagram of state, 
the critical point is located where the liquid-vapour phase equilib- 
rium curve breaks off. 

As the temperature is incre 
curves less and less and gradu 


ideal gas. À oh 
The existence of a critical point indicates that we were justified 


in stating that there is no basic difference between a gas and a liquid. 
We see that if we by-pass the critical point a transition from a liquid 
state to a gas state can be achieved without going through a phase 


transformation. 


ased, the isotherms resemble broken 
ally approach the hyperbolas of an 
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ain a liquid by compressing a substance the temper- 
above the critical point. When such a substance is 
ecomes very dense, with its molecules coming 
another. Nevertheless, a liquid in the 


We cannot obt 
ature of which is ab 
highly compressed, it b 
into close. contact with one 
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usual sense of the term cannot be obtained. The substance cannot be 
_ poured into a glass like an ordinary liquid, i.e., its state has no dis- 

. tinctive form. This is due to the fact that the phase equilibrium 
curve was not crossed during compression. The absence of such a cross- 
ing indicates that a two-phase liquid-gas system cannot be obtained. 
This means that we cannot get a liquid with a definite form; instead, 
the liquid fills the entire volume available to it. 

Compression must take place at temperatures lying below the 
critical point if a gas is to be liquefied. This is easily achieved if the 
required temperatures are reached by thermal exchange with cold 
bodies. However, in the case of oxygen, nitrogen and hydrogen, 
this is not possible since the critical temperatures are very low. In 
order to liquefy these gases, we must resort to Joule-Thomson or 
adiabatic cooling. 

In the former case, the gas is compressed by means of a compressor 
and passed through a refrigerator. Then, the gas enters a spiral 
tube and is allowed to escape through an aperture, which serves as 
the porous plug (partition) in the Joule-Thomson experiment (see 
p. 175), to a region of lower pressure (atmospheric). Upon expanding, 
the gas cools, rises to the top, and cools the spiral tube. Thus, each 
successive portion of escaping gas will be colder than the preceding 
one. Finally, a temperature is reached at which the gas is trans- 
formed into liquid. 

The other method of liquefying gas involves the use of an ex- 
pander. In a reciprocating expander, a gas is expanded adiabatically, 
performing work in moving a piston, and leaves the cylinder at a low- 
er temperature. By having each portion of gas cool the subsequent 
one, one can reduce the temperature to —450°C. Further cooling is 
impeded by the absence of suitable lubricants to maintain the fric- 
tion between the piston and the cylinder walls at a low level. A solu- 
tion to this problem was found by P. L. Kapitsa, who developed 
a refrigerating turbine—a turboexpander. A turbine is rotated by 
means of the gas from a compressor. The gas expands adiabatically, 
is cooled, and cools the subsequent portion of gas. Difficulties of 
lubrication are overcome by placing the bearings, requiring Iubri- 
cation, external to the cold region. i 
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When it is said that a substance “vaporises”, the reference is 
usually made to the vaporisation of a liquid. The vaporisation of 
solids is called sublimation. One of the most familiar examples of 
evaporation of a solid is the sublimation of naphthalene.. 

Every odorous solid sublimates to a significant extent. The odour 
is produced by the molecules separating from the substance and 


z 
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reaching our olfactory organs. Usually, however, a substance will 
sublimate to an insignificant extent. Sometimes, sublimation may 
not be detected even by very careful investigation. But, in prin- 
ciple, all solids, including iron and copper, vaporise. If sublimation 
is not detected it simply means that the density of the saturated 
vapour is extremely low. That this should be so is quite natural. The 
motion of atoms and molecules in a solid is very ordered and there 
is little probability that a molecule will separate from the surface 
of the solid. 

The density of a saturated vapour in equilibrium with a solid 
increases with increasing temperature. It can be shown that a number 
of substances having a strong odour at room temperature do not 
manifest it at a reduced temperature. In most cases, the density of 
the saturated vapour of a solid cannot be increased significantly 
simply because the substance melts first. 

Vapours are frequently used to obtain crystals when the latter are 
required in very pure form. This can be accomplished, for example, 
by precipitation on slightly cooled glass. 
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The transition from a liquid to a solid state (crystallisation) and 
the reverse transition (fusion) involve a fundamental rearrangement 
of particles. Upon fusion, long-range order in the arrangement of 
molecules or atoms disappears. 

For a given pressure, fusion occurs at a very definite temperature. 
The vibrations of molecules or atoms become so intense that the 
maintenance of long-range order becomes impossible. 

If the temperature is maintained at the fusion point, a liquid and 
a crystal may remain in a state of equilibrium, like in the case of 
a liquid and a saturated vapour. Crystals will neither grow nor melt. 
External pressure changes the fusion temperature. Asa rule, the 
fusion temperature increases with pressure, i.e., fusion becomes 
more difficult. However, there are several exceptions to this rule. 
One of these is ice. The melting of ice is facilitated by an increase 
in pressure. In terms of a phase diagram, we can briefly describe the 
normal and anomalous behaviour of bodies as follows: Usually, 


> 0, i.e., the equilibrium curve forms an acute angle with the 


d 
ture axis. In the anomalous case, T< 0 and the curve forms 


an obtuse angle with the abscissa. The strange behaviour of ice is 
related to another anomaly, namely, ice is lighter than water. The 
relationship of these two anomalies is shown in the following equa- 


. The overwhelming 


tempera 


j ; oD 
tion, which was derived above: Gp = T0,—vð 
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majority of solids are denser than their liquids. Evidently, under 
such conditions, a pressure which produces packing should facili- 
tate fusion. The relationship between the two anomalies is quite 
natural. Consider a liquid and acrystalin a state of phase equilib- 
rium. Let us raise the pressure without changing the temperature. 
The atoms should approach one another. If the solid is denser, the 
liquid is transformed into a crystal state. If the liquid is denser, the 
reverse transition occurs. 

The anomalies of water play an extremely important role in our 
lives. If they did not exist, rivers would freeze at the bottom. The 
anomalies of ice and other such bodies are due to the structures of 
these bodies. Ice crystals do not conform to the law of compact 
packing of particles. Thus, a disturbance of long-range order results in 
an increase in density rather than a decrease, as is usually the case. 

Let us return to Fig. 264 (see p. 603). The broad ice channels may 
displace molecules of ice by expanding somewhat. When ice melts, 
molecules may “fall” into this channel. In such a case, of course, the 
density will increase. No theory exists by means of which the heat 
of melting or the melting temperature could be predicted. This is 
due to the dependence on a great many structural factors. To be 
sure, the heat of melting is easier to predict, since the melting 
temperature is equal to the heat of melting divided by the entropy 
of melting. 

A measure of the binding forces between molecules or atoms is, 
of course, the heat of sublimation (the energy required to break the 
intermolecular bonds) rather than the heat of melting (the en- 
ergy required to eliminate long-range order). 

As indicated above, fusion of crystals cannot be retarded. On the 
other hand, crystallisation can be retarded and, in fact, sometimes 
will not occur at all. In order for crystallisation to begin in a liq- 
uid, there must appear a nucleus, i.e., a system consisting of several 
scores of atoms or molecules which have assumed an arrangement 
corresponding to that of a crystal of the substance. Moreover, condi- 
tions in the liquid must be favourable for the growth of this nucleus. 
In most liquids, it is difficult to achieve a significant retardation in 
the process of formation of nuclei. Cooling under strict conditions 
is required to achieve such retardation. Dust particles must be pre- 
vented from falling into the liquid and all mechanical disturbances 
such as vibrations and the jarring of the vessel containing the liquid 
must be avoided. 

At sufficiently great supersaturation, it is probably impossible to 
avoid the spontaneous formation of nuclei, i.e.,-to avoid the stabi- 
lisation requisite for the crystallisation of atomic or molecular 
groups. But something else may occur. When the temperature is de- 
creased, the mobility of the particles may decrease to such an extent 
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that the rate of growth of crystalline nuclei approaches zero. That 
is how glass is formed. Crystals will invariably grow when a few. 
crystal particles (seeds) are to be found in a liquid under conditions 
of thermodynamic equilibrium with a crystal phase. Crystals for 
industrial purposes are grown by means of seeding. 

If heat is removed very slowly, i.e., the temperature decreases 
a fraction of a degree per day, and if the crystalline particle turns in 
the liquid, the crystal grows only on a few of the faces possible. The 
growth occurs, of course, on those faces having the least surface ener- 


Fig. 282 


of such faces are always prime numbers. Most fre- 


gy. The indexes 
urface density of atoms or mole- 


quently, faces with the highest s 
cules are formed. It is difficult to determine beforehand which faces 


will grow, especially since this depends on numerous subsidiary 
circumstances. Moreover, we shall not consider the growth of crys- 
tals from solutions having certain peculiar features. However, it 
may be asserted that a crystal in equilibrium with a melt (or a solu- 
tion) will assume a form such that its surface energy is a minimum. 

The mechanism of crystal growth consists in each successive par- 
ticle attaching itself to the crystal at a point where the binding 
forces are a maximum and, therefore, the potential energy a mini- 
mum. Fig. 282 shows three possible phases at which an atom can 
become attached to a growing crystal. At A the attracting forces 
acting on an atom are greater than at B, and at B greater than at C. 
Thus, a molecule or atom will invariably attach itself more easily 
to a layer already partially formed rather than begin to form a new 


layer. 
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According to calculations, in certain cases the initial formation 
-of a new layer is bound up with such difficulties that the very growth 
-of a crystal becomes incomprehensible. In such a case, the spiral 
mechanism of explaining growth is most applicable. It is evident 
from Fig. 272 that spiral growth can continue indefinitely and new 
atoms and molecules continually become attached to points which 


Fig. 283 


are favourable from the energy viewpoint. Thus, it is not necessary 
for growth to take place through the formation of a new layer. The 
beginning of spiral growth takes place with the formation of a fault 
known as a spiral dislocation. Such a “fault” is usually due to a tiny 
foreign inclusion. A portion of the surface of a crystal which has 
developed spirally is shown in Fig. 283. 
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A transformation in the solid phase consists in a transition from 
one long-range order to another. The mechanism of such transfor- 
mations is of great interest. i 

The simplest picture for the transformation of one solid phase into 
another in the case of simple substances is obtained when the struc- 
ture of both phases constitute compact packings of spheres. Thus, 
cobalt and thalium are encountered in the form of cubic as well as 
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hexagonal packings. By shifting a layer, we can transfer it from an 
“hexagonal” to a “cubic” state, and vice versa. 

It even has been possible to grow a single crystal of a hexagonal 
phase from a single crystal of a cubic phase by means of transforma- 
lions of this kind. Usually, this is not possible since the growth of 

. crystals ‘of a new phase begins simultaneously from many centres 
and a monocrystal becomes transformed into a fine crystalline sub- 
stance. In most cases, a crystal crumbles when it is transformed into 
another solid phase. Sometimes the 
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outer “shell” of a polyhedral mono- 
crystal is preserved and a fine crys- 
talline substance occupies this perfectly 
symmetrical volume. 

The reason for the difficulty is 
clear. Crystals of a new phase may a 
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‘Fig. 284 


begin to grow from various points. But close layers in a cubic face- 
centred lattice may be formed by four different systems. Let us 
return to Fig. 264 (p. 600). In the crystal shown in this figure, close 
planes are perpendicular to the spatial diagonals, of which there are 
four in a cube (eight corners). Thus, hexagonal crystals of four differ- 
ent orientations may grow from a crystal with a cubic packing arran- 
gement. G. V. Kurdyumov’s work, devoted to transformations of 
iron and steel, laid the basis for the study of the rearrangement of 
atoms in phase transformations. At high temperatures, iron exists 
in the form of a compact cubic packing of atoms. At low tempera- 
tures, the iron atoms become arranged in a body-centred lattice. 
This transformation, known as a Martin transformation, is of tre- 
mendous importance in metallurgy* and, therefore, should be con- 
sidered in greater detail. ad 

In Fig. 284, we see what occurs when the temperature is increased. 
The left diagram again shows a compact cubic packing arrangement; 


* The hardening of steel is nothing more than a Martin transformation. 
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the right diagram shows a body-centred packing arrangement drawn 
in a rather unusual form: we see the projection of an arrangement of 
atoms as it appears when viewed along a plane diagonal of a cube. 
It would seem that these two diagrams have little in common, the 
main difference being that the left diagram represents a three-slo- 
reyed structure and the right a two-storeyed structure (the trian- 
gles are in the second storey). Less important is the difference in the 
angles of the rhombuses (not shown in the figure). As the temperature 
is increased, the atomic vibrations increase and the less compact 

- body-centred lattice becomes less advantageous at a temperature 
of 906°C. The two-storeyed structure becomes transformed into the 
three-storeyed one by the alternate shifting of the layers marked by 
triangles. For example, the odd layers shift to the left and the even 
ones to the right. This shift occurs along the diagonal of a rhombus; 
the angle of the rhombus changes at the same time. 

When a phase transformation occurs in iron, crystal particles of 
the new phase may become oriented in any of 24 different directions. 
The number 24 is obtained in the following manner. There are four 
close planes in a cubic face-centred crystal and, as can be easily 
shown, a crystal of the new phase grows in six different directions 
in a close layer. 

Undoubtedly ordered, regular processes play an important role in 
the transition from one order to another. In such a rearrangement 
of order, atoms do not have to interchange places, i.e., only an organ- 
ised shifting of atoms occurs. This is what takes place in a Martin 
transformation, a transformation which does not involve diffusion. 
However, in other transformations in a solid, diffusion phenomena 
may play an important role. 
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It has long been known that foreign atoms diffuse in a solid. The 
surface layer of steel may be impregnated with carbon (cementation), 
nitrogen or boron. Diffusion occurs to a great depth and it is not par- 
ticularly difficult to follow the process. At a temperature of 200-300°C, 
significant quantities of silver penetrate lead to a depth of several 
centimetres in an hour. 

However, not only foreign atoms migrate in a crystal. An iron atom 
can migrate in a crystal of iron and a copper atom in a crystal of 
copper. If a piece of radioactive copper is pressed against an ordi- 
nary piece of copper, the latter will soon become “contaminated” 
(radioactive). By means of tagged atoms, one can study the diffusion 
of atoms of the same kind, as well as of “foreign” atoms. 

Diffusion is possible as a result of thermal vibrations. When an 
atom leaves its equilibrium position, a neighbour takes its place. 
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Upon returning, the atom occupies the vacated spot. Thus, atoms 
interchange positions. Such an interchange is, of course, not easily 
achieved if only two atoms participate in the process. When two 
atoms interchange positions in a solid, a whole group of atoms are 
involved. An atom slips forward only when the thermal vibrations of 
many atoms accidentally create favourable conditions for this. 

Dislocations, empty spaces and fractures, which always exist in 
a crystal, play an important role in diffusion. The presence of an 
emply space in a crystal facilitates the step-by-step migration of 
an atom through the lattice. An atom hindering this migration is 
“pushed” into the empty space. 

If a foreign atom is not very large, it may move through the lat- 
tice without interchanging positions with lattice atoms. When the 
conditions become favourable, such an atom slides from one empty 
space in the compact packing of spheres to the next. 

Diffusion is a two-way effect. If a zinc plate is pressed against 
a copper one, zinc atoms will penetrate the copper and copper atoms. 
will penetrate the zinc. To be sure, the rates of flow in the two direc- 
tions may vary considerably. 

The diffusion of atoms through a crystal depends on many factors. 
It is interesting that a diffusion process proceeds most rapidly 
when the foreign atoms differ in all respects from the atoms of the 
crystal through which they moye. Diffusion proceeds mest slowly 
for atoms which are the same as those of the crystal or in the same 
column of the Mendeleyev periodic table as those of the crystal. 

As already indicated, the presence of fractures and dislocations 
facilitate diffusion. Therefore, diffusion proceeds most rapidly in 
a deformed metal. The rate of diffusion is greatly dependent on tem- 
perature. This is not surprising since the diffusion coefficient, the 
coefficient of proportionality between flow of matter and concentra- 
tion gradient, can always be represented by an expression of the form 


Ey 
Ae-UAT, 


where U is the height of the potential barrier which an atom must 
surmount in an elementary diffusion event. The existence of such 
a relationship is rather evident since the diffusion coefficient must 
be proportional to the number of atoms the energy of which suffices 
to cross the potential barrier. 

Such barriers are quite high. In the case of self-diffusion, they are 
usually about 1-2 ev. It will be recalled that kT at room temperature 
is equal to ~0.03 ev. The number of atoms with energies much 
greater than the average is very small; hence, practically no diffusion. 
occurs. At a temperature of the order of 1,000°C, the situation is 
entirely different. 
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In the preceding article, we discussed crystal-crystal transforma- 
tions occurring in an organised manner without diffusion. It should 
not be assumed that all phase transitions occur this way. On the 
contrary, at a sufficiently high temperature, the interchange of posi- 
tions by atoms begins to play a very important role and the organ- 

_ ised nature of transitions will be achieved in only small regions, or 
may even be completely obscured by the interchange of posilions 
by atoms. 

If a solid state transformation is of a diffusive nature, it proceeds 
at a rate that is comparable to that of self-diffusion processes. The 
heights of potential barriers surmounted by atoms during rearrange- 
ment are of the same order as during self-diffusion processes. 

In the case of organised displacement of atoms of the Martin 
transformation type, the transformation proceeds at a ten-fold 
rate at low temperatures. 


CHAPTER XXXIV 


DEFORMATIONS OF BODIES 


253. Elastic Properties 


For every solid, there exists a distorting force limit up to which 
a deformation is elastic. This means that if the elastic limit is not 
exceeded, the body returns to its original state. 

Elastic deformations, like other deformations, are associated with 
the displacement of atoms (or molecules). When a body is extended 
elastically the interatomic spacing increases and when it is com- 
pressed the interatomic spacing decreases. 

The distinctive feature of elastic deformations is that they do 
not destroy interatomic bonds or create new ones. 

When a crystal is deformed elastically, all of the atoms continue 
to have the same neighbours. Thus, in elastic displacement, the lat- 
tice of a crystal as a whole becomes deformed (sloped). Hence, each 
atom continues to have the same neighbours. This enables the body 
to return to its equilibrium state when the distorting force is removed. 

The change in interatomic spacing that may be achieved by means 
of elastic extension or compression is quite small. The maximum rel- 
ative elongation of an elastic nature does not, as a rule, exceed 0.001. 
This means that for interatomic spacings of the order of 2A the 
equilibrium, positions of the atoms may be displaced by no more 
than 0.002 A. Such small changes in a lattice period can be detected 
through X-ray analysis by observing the displacement of diffraction 
lines on a roentgenogram. This requires that the lines be filmed at 
large 0 angles, since only in this manner can small changes in inter- 
planar spacings be detected (see p. 391). 

The elastic deformation of polymers such as rubber is of an entire- 
ly different nature. Rubber has mechanical properties which basi- 
cally differ from those of crystalline substances. The fundamental 
difference lies in the magnitude of elastic elongation. Certain kinds 
of rubber may be stretched to 10-15 times their normal length without 
exceeding the elastic limit. Thus, they may be elongated 10,000 
times more than metals! The magnitude of the modulus of elasticity 
of rubber is no less striking. 

A steel wire having a cross-section of 1 mm? will stretch one twen- 
ty-thousandth of its length under the action of a load of 1 kg, but 
a rubber band having the same cross-section will stretch to twice 
its original length under the action of such a load. 
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Two processes occur when polymers are stretched. First, tangled 
bundles of molecules become disentangled. At the same time, there 
occurs regular packing of certain portions of the disentangled bun- 
dles of molecular chains into a three-dimensional order. Evidently, 
the fact that crystallisation takes place upon stretching is of second- 
ary importance, since the established order does not remain when 
the external force is removed, i.e., the bundles of molecules become 
twisted once again. 

The twisting of bundles of molecules is accompanied by an increase 
in entropy, i.e., an increase in the degree of disorder. It turns 
out that the internal energy of rubber and similar polymers practi- 
cally does not change when such substances are elastically deformed. 
Therefore, the work of stretching, which according to the fundamen- 
tal laws of thermodynamics is dA = dU — TdS, in this case simply 
equals —TdS, i.e., it is directly proportional to temperature. (it 
should be recalled that the work of external forces on a system is 
considered to be negative.) In this respect, the elastic deformation 
of rubber is of the same nature as the isothermal compression of 
a gas (cf. p. 178). 

‘At should be noted that both the elastic deformation of a crystal 
and that of rubber do not yield a new potential energy minimum. 


In the case of a crystal, this is due to the fact that the energy does 
not change at all. 


254. Plastic Properties 


Slippage. The elastic deformation of a crystal consists in the chang- 
ing of its interatomic spacing with each atom maintaining its same 
neighbours. On the other hand, in the case of a plastic deforma- 
tion—a deformation which remains when the external force pro- 
ducing it is removed—atoms surmount their potential barriers and 
enter new “potential wells”, i.e., change their neighbours. The basic 
mechanism of plastic deformation is the slippage of one atomic plane 
relative to another. An element of such slippage consists in the dis- 
placement of all of the atoms by one period. This can be detected 
with the naked eye in the form of so-called slip bands. Slippage 
occurs at the weakest points (fractures and other defects) and the 
crystal breaks up into layers (slip stacks). The plotting of stack 
thicknesses, the order of magnitude of which is equal to several 
tenths of a micron, yields a random distribution curve. The forces 
required to displace atomic planes having different indexes will 
differ. Usually, it is easiest to displace the planes which are most 
compactly filled with atoms. However, slip planes of crystals may 
change with changes in temperature and impurities and also during 
the deformation process itself. In aluminium, the plane (414) is 
a slip plane. 
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Slippage occurs along a given plane having a definite orientation. 
Usually this plane has the densest distribution of atoms (e.g., [101] 
in an all-sided face-centred cubic lattice). 

In order for displacement to begin, a certain minimum stress 
(ultimate shearing stress) is required. The magnitude of this stress 
is very small, reaching several grams per square millimetre in some 
cases. In measuring the ultimate stress, one must, of course, take into 
account the orientation of the slip plane relative to the external 
force. 

Strength. A monocrystal of zinc can be easily bent by hand. How- 
ever, it is not possible to straighten it in the same manner. This 
is due to the fact that its strength has increased. 

The resistance of a crystal to displacement increases with increas- 
ing deformation. Therefore, plastic displacements along a given slip 
plane do not cause the material to rupture, but rather cease when the 
strength is sufficient to oppose the external force, and thereupon dis- 
placement begins in other planes. Thus, the number of slip bands 
increases and the slip stack thicknesses decrease. 

If an external force is applied to a crystal previously subjected 
to plastic deformation, such deformation will resume, of course, when 
the magnitude of the force reaches the value at which it previously 
ceased to be effective owing to an increase in strength. It can be stat- 
ed, therefore, that an increase in the strength of a crystal increases 
its elastic limit—and, moreover, by a large factor. 

One theory relates the described increase in the strength of a crys- 
tal to a disturbance of the regularity, i.e., distortion, of its lattice. 
From this viewpoint, it is quite natural that the strength of a crystal 
should increase with increasing rate of deformation and decrease 
with increasing temperature. However, from the viewpoint of the 
dislocation theory discussed above, the picture is different. 

Plastic Deformation as a Displacement of Dislocations. Let us 
consider in greater detail the process of displacing one atomic plane 
relative to another. If there are no dislocations in a slip band, it is 
necessary to shift every row of atoms in the displacement plane. The 
situation is quite different when a displacing force acts on a crystal 
containing dislocations. 

Fig. 285 shows a compact packing of spheres which contains 
a simple dislocation (only the end spheres of the rows are shown). 
For simplicity, let us assume that the dislocation region embraces 
a minimum number of rows. Then, the dislocation consists basically 
in the following: Between two rows of the upper, extended layer, 
adjoining the boundary between blocks, there is a linear gap. In the 
lower, compressed layer on the other side of the boundary between 
blocks, there is an extra row of atoms (the two rows of atoms just 
below the linear gap are very compressed). Now, let us begin to dis- 
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place the upper block to the right relative to the lower one. At 
a certain initial instant, the “fissure” is between rows 2 and 3; rows 
2’ and 3’ are compressed. As soon as the force becomes effective. 
row 2 moves into the “fissure”, the shape of sphere 3’ is restored, 
and sphere /’ becomes compressed. The entire dislocation has 
shifted to the left, and it will continue to move in this direction 
until it is “pushed out” of the crystal. In other words, displacement 


Fig. 285 


consists in shifting the dislocation line along the displacement plane. 
It is clear that a much smaller force is required to achieve dis- 
placement when dislocations are present. 

According to calculations, the strength of a crystal in which there 
are no dislocations should be a hundred times as great as the strength 
of an actual crystal, determined experimentally. The presence of 
a small number of dislocations suffices to decrease the strength to 
a small fraction of that of an ideal crystal. 

Fig. 285 shows how a dislocation is “pushed out” of a crystal by 
an applied force. Thus, as the degree of deformation is increased, the 
strength of a crystal increases. When the last dislocation of a crystal 
is eliminated, its strength will be about a hundred times as great 
as that of a perfectly normal crystal. In this manner, an increase in 
strength can be easily explained. To be sure, in order to obtain quan- 
titative agreement between calculations and experimental results, we 
must assume that special as well as ordinary dislocations may aid 
displacement. 

Excellent confirmation of this theory is provided by the fact that 
the strength of perfect crystals which are grown artificially is approx- 
ximately equal to the calculated value for an ideal crystal. 
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255. Ultimate Strength 


As the stress within a body increases, its deformation increases 
up to a certain point (the elastic limit). Then, plastic deformation 
begins and because of an increase in strength the curve rises sharp- 
ly. Finally, the rupture point is reached. 

The ultimate strength of a body depends, to a certain extent, on 
the duration of an applied force. When the force is of prolonged dura- 
tion, the ultimate strength drops. The values given in engineering 
handbooks generally refer to short-duration tests. The dependence 
of the ultimate strength of a body on the duration of an applied 
force indicates that in slow processes the behaviour of a solid depends 
on internal diffusion processes. It may be assumed, for example, 
that even small forces can create a favourable trend for processes 
involving an interchange of positions by atoms. 

The value calculated for the ultimate strength ofa perfect crys- 
tal is several hundred times as great as measured values. This sig- 
nifies that faults play a fundamental role in a crystal. The calculat- 
ed value for the rupture strength of a monocrystal of rock salt is 
200 kg/mm?. Under ordinary conditions, a rod of this substance will 
rupture when it is subjected to a load of 0.5 kg/mm?. 

The role played by fractures was demonstrated in the well-known 
experiments of A. F. Yoffe, who investigated the rupture of rods of 
rock salt in water. Water dissolves the surface of such a rod and 
“heals” the microfractures formed during extension. As a result, 
the resistance of rock salt to rupture becomes greater than 100 kg/mm*, 
i.e., it approaches the theoretical value. 

The presence of faults serves to decrease the effective area at which 
rupture occurs. In the final analysis, the force acting at the instant 
of rupture is determined by the number of broken interatomic bonds. 

The strength of various materials determined in this manner vary 
within narrow limits. Thus, a piece of thread or rubber band rup- 
tures at'a stress of the same order of magnitude as that of a steel wire. 


256. Mechanical Properties of a Polycrystalline Material 


al, elastic stage of deformation, the grains of 
al are not deformed uniformly since they have 
various orientations relative to the line of action of the force. As 
a result, the elastic properties of a polycrystalline substance will 
differ from those of a monocrystal. : 

However, in the plastic region, the behaviour of a polycrystalline 
substance may differ from the behaviour of a monocrystal of the same 
substance to an even greater extent. A polycrystalline substance 
offers greater resistance to an external force. This is not surprising 
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since the development of plastic displacements in a given grain will 
be impeded by neighbouring grains in which slip planes are orient- 
ed entirely differently relative to the applied force. 

Moreover, in a polycrystalline material, there occurs an entirely 
new phenomenon: the turning of grains and the formation of tex- 
ture. The turning of grains in drawing, rolling and other deformation 
processes is determined by the tendency of each grain to become 
aligned in such a manner that slippage is facilitated, i.e., with its 
slip plane parallel to the applied force. If there are several slip planes, 
the grain assumes a position for which the effect of the slip planes 
is maximal. For example, in most metals having cubic all-sided face- 
centred cells, the grains tend to become arranged with the [41414] 
directions parallel to the axis along which the material is drawn. 

Another new phenomenon occurring in such a material consists 
in the slippage of the grains relative to one another along an inter- 
crystal layer. Such displacement differs from crystal displacement 
and is more like viscous flow in thick liquids. 

A polycrystalline material may not rupture at the same value of 
stress as a monocrystal. In certain cases, the grains of a polycrystal- 
line material remain whole when the material ruptures. This occurs 
when intercrystal layers have weak mechanical properties. As a gener- 


al rule, the strength of a material increases with decreasing grain 
size. 


257. The Effect of Surface-Active Substances on Deformation 


Every solid has numerous ultramicroscopic structural faults, 
arising as the result of the thermal mobility of its atoms and the 
presence of impurities and mechanical defects. These faults are dis- 
tributed throughout the volume of ‘a solid and in many cases can be 
viewed as extremely minute embryonic microfissures. The basic 
characteristic of such embryonic microfissures is their ability to in- 
crease in size during the process of deformation of the material. 

Under the action of external forces, the size of these microfissures 
increases and a concentration of stress arises at their edges. This, 
in turn, further facilitates the growth of the microfissures. In the 
case of a brittle material, such an increase in the size of microfissures 
during a deformation process may result in premature rupture of 
the material. In the case of a plastic material (most metals) such an 
increase in the size of microfissures during a deformation process 
results in the formation of plastic displacements. When a body is 
in a three-dimensional stressed state, the microfissures have wedge- 
shaped cross-sections and are characterised by exposed surfaces— 
orifices and cul-de-sacs in which the fissures preserve their embry- 
onic nature. Actual fissures terminate in cul-de-sacs like a sharp 
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blade having a very large curvature, with a radius of curvature of 
the order of magnitude of the lattice spacing. When the deforming 
forces are removed—in the range of elastic deformations—the micro- 
fissures gradually “heal” in reverse order, i.e., first the cul-de-sacs 
close and then the orifices. 

P. A. Rebinder has shown that the effect of the surrounding medi- 
um of a solid on its mechanical properties is not restricted to such 
chemical action as corrosion and dissolution. The exposed surface of 
a solid is always coated with a thin film of a component of the sur- 
rounding medium which is most closely related to the given solid. 
Such a substance may be a gas or vapour usually contained in air, or 
a substance located in the vicinity of the solid. The molecules of 
a substance clinging to the surface of a solid, or, as we generally 
say, adsorbed by a solid, are able to move along this surface and 
migrate from a region where there is an excess of such molecules to 
a region where there is a deficiency for complete coating of the sur- 
face. The tendency of an adsorbed layer to occupy all of the surface 
available to it is due to the fact that adsorption decreases the surface 
energy of a solid. Substances which may be adsorbed by the surface 
of a solid are called surface-active substances. Various organic alco- 
hols, acids, and salts of these acids, i.e., soaps, are highly surface- 
active substances with respect to metals. 

The strength of a solid decreases when it adsorbs a surface-active 
substance. If a solid is ruptured in a medium containing even a small 
amount of a surface-active substance (for example, in a solution of 
oleinic acid in pure vaseline oil), the force required to rupture the 
material is less than under ordinary conditions. This is particularly 
evident in the crushing of soft rocks and in the subjection of metals 
to a variable force or a force of prolonged duration. 

The effect of adsorbed molecules on the strength of a solid can be 
explained as follows. When molecules are adsorbed by the surface of 
a body, they penetrate the microfissures as a consequence of their 
mobility and tendency to occupy all of the exposed surface of the 
adsorbent. The drawing of adsorbed layers into a microfissure is due 
to the decrease in the surface energy of a solid caused by such penetra- 
tion. If an obstacle is placed in the path of an adsorption layer tend- 
ing to occupy a surface area of a solid which is not yet occupied, 
the adsorption layer will exert pressure on the obstacle. Within 
a microfissure, such an obstacle is provided by the molecules them- 
selves, i.e., their size prevents them from penetrating deeper into 
Therefore, at the boundary of an adsorbed layer 
within a microfissure, a, pressure, which is directed so as to increase 
the size of the fissure in depth, arises. Adsorbed layers behave like 
wedges driven into the microfissures. Thus the penetration of adsorb- 
ing molecules into the orifices of microfissures tends to create 
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additional disrupting forces. This is equivalent to increasing the 
external deforming forces. Therefore, the rupture of a solid in the 
presence of adsorbing substances is brought about by lower ap- 
plied forces. 


258. Material Breakdown Under the Action of a Stream 
of Particles 


The problem of material breakdown under the action of a stream 
of particles is of great importance in the construction of nuclear 
reactors. The materials of a reactor, including nuclear fuel, modera- 
tor, walls and instruments, are subjected to the action of neutrons, 
fission fragments, electrons, etc. Let us consider those actions of 
streams of particles which leave permanent effects. 

In the first place, we should mention particle collisions in which 
an electron providing a chemical bond between atoms is dislodged 
from its position. In such cases, ionisation results in the rupture 
of the bond. This bond is not necessarily re-established. Moreover, 
the ions or radicals which are formed may recombine in a different 
way. Therefore, in molecular materials, ionisation results in the 
breakdown of certain molecules and the creation of new ones. 

An atomic nucleus may also be displaced from. its position. In 
such a case, it drags along its electronic cloud. Therefore, it can be 
said that an entire atom, rather than just a-nucleus, has been dis- 
lodged from its position. Such an effect of the action of radiation is 
almost always irreversible. ; 

Materials are damaged by radiation as a result of the displacement 
of atoms from their positions and the rupture of chemical bonds. 
Atoms are dislodged under the action of heavy charged particles and 
fast neutrons. Chemical bonds are ruptured under the action of slow 
neutrons, y-rays and electrons. 

Let us consider more carefully what occurs when atoms are dis- 
lodged from their positions in solids. The process of displacement 
of atoms is a chain process. This means that the first displaced atom 
displaces another atom which is located in its path; the latter is 
able to displace a third atom, etc. By means of such a chain process, 
a single fast nuclear projectile is able to produce considerable distor- 
tion in the crystal lattice of a solid. The nature of the distortion 
varies considerably from case to case. A crystal lattice may be com- 
pletely destroyed. Foreign atoms may penetrate between atoms ol 
the primary lattice. Also possible are processes involving the sub- 
stitution of atoms of the primary lattice by projectile atoms. The 
number of displacements per charged particle for one element does 
not differ greatly from the number for another element. An g-parti- 
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cle having an energy of 5 Mev, or a proton having an energy of 
20 Mev, dislodges 60-80 particles. 3 

These figures suggest that the heavier the particle, the greater the 
damage. Analogous figures for fission fragments of uranium-235 
or plutonium-239 nuclei are conside rably more unexpected. Such 
a pair of fragments produces, for example, 25,000 displaced atoms in 
uranium and 8,300 displaced atoms in graphite. 

The process of slowing down a neutron from its initial velocity 
to thermal velocity also does not proceed without causing damage 
to materials. The slowing down of a neutron causes 450 atoms to be 
displaced in beryllium, 1,870 atoms in graphite and 6,030 atoms 
in aluminium. 

It is evident from these figures that significant changes in the prop- 
erties of a crystal lattice are to be expected as a result of the dis- 
placement of atoms from their positions in the lattice. Metals are of 
primary interest in ‘this connection. The reason for this is that the 
sole permanent effect of radiation in metals is the displacement 
of atoms. 

The action of neutrons and fission fragments has been studied most 
carefully. This is not surprising since such investigations are of basic 
importance in the design of nuclear reactors. The effect of a dose 
of 10! neutrons per sq cm has been studied in detail. Such a dose is 
not very large. Generally speaking, the materials of a nuclear reactor 
are subjected to such a stream of neutrons during each day of opera- 
tion. However, even in the case of such a small dose, the properties 
of metals undergo important changes. These changes approximate 
those which occur in the cold working of metal. Thus, under the 
action of neutrons and fission fragments a metal’s brittleness and 
hardness increase, its ductility decreases, and its electromagnetic 
properties also change. 

Radiation damage in metals consists mainly in the displacement 
of atoms from their positions, but in organic materials, where the 
atoms are connected by chemical bonds, the changes consist mainly 
in the rupture of such bonds as the result of ionisation. Organic mate- 
rials are very rapidly broken down under -the action of radiation. 
A dose of the order of 10!° neutrons per square centimetre practically 
disintegrates an organic substance. In a reactor, paraffin, olefin and 
polyphenyl sustain a damage of 25 per cent within several hours. 

The transformation of an organic material usually consists in the 
liberation of a gas and in polymerisation. However, it should be noted 
that in certain cases high-polymer materials are depolymerised 
under the action of radiation. - 


CHAPTER XXXV 


DIELECTRICS 


259. The Relationship Between Permittivity 
and the Polarisability of a Molecule 


In a number of cases, particularly in gases, the molecules of 
a substance do not interact with one another. Hence, the electrical 
properties of such a substance are determined by the average behav- 
iour of one of its molecules. Molecules do not interact with one another 
in many dilute solutions as well. Occasionally, molecular inter- 
action plays a secondary role even in a concentrated solution. 

Therefore, consideration of the electrical properties of a substance 
which consists of a large number of noninteracting molecules is of 
considerable importance. ė 

The dipole moment of a unit volume of dielectric, P, is determined - 


by the permittivity ¢ and the field intensity Æ in accordance with 
the formula > 


peas E 


4n 


(see p. 256). On the other hand, the polarisation vector P is equal to 
the sum of the dipole moments in a unit volume of dielectric: 


P=} p 
or 


P=Np, 


where NV is the number of molecules in such a unit volume and p 
is the “contribution” of each molecule to the polarisation vector. 
If Æ’ is the field intensity which acts on a molecule, then 


p= BE’, 
where f is the polarisability of the molecule. 

It would seem that the relationship between 6 and e should now 
be given by the expression e = 1 + 4aNB. However, this is not $0; 
and that is why the field intensity in the above formula was denoted 
by Æ prime. The equations relating P and E and p and E’ involve 
different field intensities. Æ is the force acting on a unit test charge; 
such a charge does not distort the existing field. Æ’ is the field pro- 
duced by all the molecules (with the exception of the given one) 
acting upon the given molecule. 
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On p. 261, it was indicated that the field inside a dielectric sphere, 
E;, is related to the external field in which this sphere is located 


as follows: 
4 
E;=E.—z aP. 


It is evident that the field in a spherical cavity which is cut in a di- 
electric may be determined by changing the sign of P: 


E,;=Ee+4nP. 


[t may be rigorously proved that Æ’, the field due to all the mole- 
cules of a gas (except the one acted upon), is equivalent to the field 
in a spherical cavity. Thus, 


E'=E+$ nP. 


The relationship between f and s may now be determined. Equat- 
ing the derived expressions for P, we obtain 


e— i1 = ? 
7 E=Np= NE". 


5 À h 4 —1 r 
Now, substituting 2’ = E+ site E, we obtain the so-called 


Clausius-Mossotti formula: 


e—1 An ar 
ep = NB. 


If each member of the equation is multiplied by > where M is 
the molecular weight and p is the density, the resulting expression 
will depend only on the polarisability B. Thus, N =No = 


= 6.02 x 10% (Avogadro's number). 
The quantity 
EAM A 
f= = wt > Nap 


ecular polarisation. To determine the molecular po- 
sure the permittivity of a substance as 
f a condenser filled with the substance 
under investigation to the capacitance of the condenser with the 
dielectric removed. Capacitance is usually measured by means 
of a bridge. Such bridges are constructed for the range 30 cps to 
300,000 cps. However, bridges may be constructed for frequencies 


up to 40 mc/s. 


is called the mol 
larisation, one must first mea: 
the ratio of the capacitance o 


~ 
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A wide variety of permittivity meters (instruments for measur- 
ing e) are employed. Such instruments are very sensitive and enable 
us to measure permittivity with a high degree of accuracy. Indeed, 
excellent results may even be obtained with gases having a pressure 
of the order of 1 mm of Hg. 

Using the formula e = n? (see p. 331), we can derive an index of 
refraction equation which is analogous to the molecular polarisa- 
tion equation: 


2 


R= 


1 M An ar 

n? a2 Dhan Syn arf. 

This characteristic of a molecule is called molecular refraction. 
Measured values of R and & for different frequencies of electromag- 

netic vibrations may differ considerably from one another. 
Despite the fact that the derivation of these formulas presupposes 

a gas, molecular interaction evidently changes matters little. In 

any case, the formulas for R and # are widely used in the investiga- 

tion of dilute solutions as well. 


Examples. Let us consider benzene, Cgly (e=2.28; p=0.88 gm/cm; 
M =78), and water (e =81; p=1 gm/cem#; M—18). Assume that the plates 
of a flat condenser, which creates an electric field £300 v/em=1 Gaus- 
sian unit, are immersed in these liquids. 


1. Let us calculate the polarisation (the electric moment of a unit 
volume of dielectric) of benzene and water: 


e—I 2.28 —1 


Pinze = So = i ranean it: 
benzene = -z E Tada 4-04 Gaussian unit; 
P water =6.4 Gaussian units. 
z A at ip 
The contribution of each molecule to the polarisation vector is Doe? 
pe NADAR 
where N= MA is the number of molecules per unit volume; 


Poenzene = 1.9 10-23 Gaussian unit; 


Puater =19-4X 10-23 Gaussian unit. 


2. Let us calculate Æ’, the intensity of the electric field due to all 
molecules (except the one acted upon): 


Evonzene =E + 3 %Pvenzene= 1-43 Gaussian units; 


Fivater = 27-8 Gaussian units. 


i.e., the field in water is about 28 times (!) as great as the applied field. 
Now, the polarisability of molecules of benzene and water can be deter- 
mined: 


Boen zene =f = 1.05 10-23; Byater =0.7x 10-28, 
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3. From measurements of s by means of a permittivity meter, we can 
e—1 M 
e+2 p` 


calculate the molecular polarisation P = 


Penzene ~ 26.6 Gaussian units; 


P water = 17.3 Gaussian units. 


4. Measurements of the index of refraction n by means of a refracto- 
meter yield rpenzene = 1.5014 and rater = 1-330. Using these values, we can 
i n 1M 
calculate the molecular refraction R= —: 
n2+2 p 


Roenzene= 26.1 Gaussian units; 

Rwater =3-6 Gaussian units. 
It is seen that in the case of benzene P ~R and in the case of water the 
values of P and R differ considerably. The reason for this will be ex- 
plained in the next article. 


260. Polarisation of Polar and Nonpolar Molecules 


Polarisation of a substance under the action of an electric field 
may occur for two reasons. First, the centre of gravity of the electron- 
ie cloud may be displaced (inherent polarisability). Secondly, the 
field has an orienting action which may turn molecules having 
a constant, or rigid, dipole moment closer to the direction of the 
field. Therefore, it is customary to divide polarisability into two 
parts: a—inherent polarisability and b—orientation polarisability. 

A molecule must be turned as a whole in order for the dipole to 
become oriented. Owing to the inertia of a molecule, such turning 
requires a certain amount of time. For rapid electromagnetic vibra- 
tions, a rigid dipole cannot follow the field. Therefore, in the case 
of light waves, the orientation polarisability b is absent. 

Thus, 


Ji 4s 
P= Nav (a+b) and R => N axa. 


The polarisability @ of a molecule can be determined by measuring 
the index of refraction. If, in addition, & is also measured, the orien- 
tation polarisability b is obtained by subtraction. 

The magnitude of the orientation polarisability is directly related 
to the rigid dipole moment p of a molecule. We shall show that 

p> 

bas: ' 

Gas molecules are randomly oriented as a result of chaotic thermal 
motion. In the absence of a field, the assumption of any direction by 
the dipole moment p of a molecule is equally probable. The situa- 
tion changes if a field Æ isapplied. The potential energy of a dipole 
is equal to e (p+ — -)) where p+ and ọ- are the potentials of the 


42—1409 
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field at the ends of the dipole, i.e., 


G 


—e “t l= — pE = — pE cos), 


where 0 is the angle between the field vectors and the dipole moment. 
A dipole oriented in the direction of the field has a minimum energy. 
This energy is equal to—pZ#. Thermal motion prevents all dipoles 
from assuming a position of minimum energy. A certain compromise 
distribution is established between the tendency to maximum entro- 
py and the tendency to minimum energy (see p. 629). The Boltzmann 
law is an expression of this compromise. The probability that the 
energy of a molecule lies between U and U + dU is proportional 
—U 


to eT dU. In our case, U =—pE cos 0. Therefore, dU=pE sinOd0. 
The fraction- of molecules the dipole moments of which lie 


pE a 
between 0 and 0 + d0 will be equal to e*T ios sin 0 dð. 

For ordinary temperatures, pë < kT. Even for extremely strong 
fields of the order of 10° v/em, the ratio 26 will be of the order of 
0.01 (the order of magnitude of dipole moments is 10-18 Gaussian 
unit). Therefore, we can use the approximation e* œ 41 -+ x, and 


the fraction of molecules sought will be equal to 
const (1 + cos 0) sin 0 d0. 


The integral of this expression from 0 to x should equal unity from 
the probability viewpoint, since for any molecule the direction of p 
lies somewhere between 0 and a. Then, as can be easily verified, the 


constant is equal to L and the fraction of molecules the polarisation 


vectors of which lie in the interval O to 0 + dO will be equal to 


1 pE F 
F (1 +r 60s 0) sin 0 d0. 

The projection of the dipole moment on the direction line of the 
field is p cos 0. If N is the number of molecules per unit volume, 
the fraction contributed to the polarisation vector by molecules 
inclined at an angle 0 to the field will be equal to 

4 E a e 

{Np (4 +47 cos 0) sin 0 ċos 0 dé. 
The polarisation vector P can be determined by integrating this 
expression from 0 to x. We obtain 


yy Bp. 
P=N zr E 


OO a 
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hence, the orientation polarisability is given by the formula 


The relationship between molecular polarisation and temperature 
is expressed by the formula 


ag ee 2 
P= Na (a Sr tr) : 


This theoretical conclusion is in excellent agreement with experi- 
mental results. By measuring & as a function of 7, we can easily 
determine the two parameters which describe the electrical proper- 
ties ofa molecule, viz., polarisability and the “rigid” dipole moment p. 

Thus, the values obtained for a by measuring R can be used to 
determine p ‘by substituting in the expression for P. 

Experiments indicate that in certain cases the interaction of di- 
poles of neighbouring particles may result in significant changes in 
permittivity as compared with the value of e for a system of nonin- 
teracting molecules. This can be shown by measuring e in a liquid 
and in a gas formed of the same molecules. 

The interaction of particles also affects the permittivity of crystals. 

As a rule, electric polarisation in crystals occurs only as the result 
of the deformation of electronic clouds and the displacement of ions. 
No orientation polarisation occurs since, by and large, molecules 
cannot turn in a crystal. 

In many ionic crystals, the index of refraction squared is consider- 
ably less than the permittivity. (For example, the values for rock 
salt are 2.37 and 6.3, respectively, for titanium dioxide 7.3 and 144, 
and for lead carbonate 4.34 and 24.) In such crystals, the electronic 
cloud is deformed and, in addition, the ions are displaced as a whole 
under the action of a static field. On the other hand, it has been estab- 
lished that in molecular crystals the permittivity is equal to the 
square of the index of refraction. This indicates that polarisation is 
due exclusively to the deformation of the electronic cloud. 

Since orientation polarisation is absent, permittivity varies very 
little as the temperature changes. . 

It has already been indicated that in the case of a rapidly varying 
field there is no orientation polarisation and the molecular polarisa- 
tion becomes equal to the refraction. It is important to know which 
field oscillations should be considered rapid. This can be determined 
if the relaxation time is known. When the relaxation time t is 
much greater than the oscillation period, there is no orientation 
polarisation. i 

The relaxation time t was discussed on p. 164. If a dielectric is in 
a constant field, its dipoles assume an equilibrium orientation dis- 


42% 


= 
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tribution which depends on the temperature. When the field is 
switched off, the dipoles become disoriented. However, this does not 
occur instantaneously, i.e., the order decreases in accordance with an 
exponential law. The rate of this decrease is described by the relaxa- 


tion time t—the time in which the polarisation decreases to — of ils 
P 


original value. If t is much greater than the oscillation period, the 
direction of the external field changes before the dipoles change their 
orientation. A very rapidly varying field does not affect the behaviour 
of the dipoles at all. If t < 7, at each instant the state will be in 
equilibrium and the polarisation will closely follow changes in the 
field. For most dielectrics, the relaxation time is of the order of 
AO =10529) “secn. i 


_ Examples. 1. Using the results of the example on p.656 let us deter- 
mine the values of the inherent polarisability a and the orientation po- 
larisability b for benzene and water. 


a=3R/40N 4v; whence apenzene =10-23 Gaussian unit; 


water = 0.14% 10-23 Gaussian unit. 
On the other hand, 


a+ b=3P/40N Av, (@+b)benzene= 10-23 Gaussian uni t 


(a+ b)water=0.7Xx10-23 Gaussian unit. 
It follows that 
boenzene=0 and bwater=0.7X10-23—0.14x10-23 = 
=0.56x10-23 Gaussian unit. 


This means that a benzene molecule does hot have a rigid dipole moment, 
but a water molecule does. 


2. Let us determine the rigid dipole moment of an er molecule from the 
= a 


formula p= //3kTb. If the molecular polarisatio: d the molecular 
refraction R are measured at room temperature (T ' 


p= V3 X 1.38 x 10716 x 300 x 0.5 X 10-23 —0.8% 10-48 Gaussian unit. 


This value is in close agreement with experimental results. 
Frequently, the unit 1 debye = 10-18 Gaussian unit is used as a measure 


of dipole moment. This unit is named after the German scientist Debye, who 
developed the theory of dipole moments. 


261. Additivity of Molecular Refraction 


The refraction R is a molecular constant. R does not depend on 
the density or phase of a substance (this has been demonstrated exper- 
imentally), nor on the temperature of the given substance. A conven- 
ient property of refraction is its additivity. If it is possible to com- 
pile a table of property increments* for all possible atoms, and if 
the magnitude of the property is determined as the sum of the incre- 
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ments, such a property is said to be additive. The additivity of R 
can be used for analytical and identification purposes. (It should be 
noted that this additivity possesses no theoretical basis and in a num- 
ber of cases there occur significant deviations from the ideal.) 

Innumerable observations have been processed by numerous inves- 
tigators, and tables of R increments have been compiled. (Most of 
these tables are for Rp increments, i.e., measurements of the index 
of refraction for the so-called D-line, the yellow line of sodium.) 
For example, for C, H and Cl atoms, the increments are equal to 
2.418, 1.100 and 5.967, respectively. By means of these values alone, 
one can predict the molar refraction of many compounds: 

methane, CH,: R=2.418+-4 x 1.100; 

chloroform, CHCl: 418+-1.100-+3x5.967; 

carbon tetrachloride, CCl R=2.418+4-4%5.967, 
ete. Refractions can be measured with great accuracy and when neces- 
sary extremely small differences can be determined, 

In view of dispersion anomalies, which, as was explained earlier, 
occur at frequencies close to the 
natural frequencies of absorption, 
refraction should be measured in 
a region far removed from the 
absorption bands. 

Indexes of refraction are meas- 
ured by means of refractometers. 
Most refractometers measure the 
angle of refraction of a beam of 
light (emerging from a material 
under investigation) which impinges 
on the surface of a prism made of 
glass with a higher n. 

If a bundle of rays with an angle 
of incidence of 0° to 90° reaches 
the boundary between the mate- 
rial under investigation and the 
glass, the refracted rays will lie 
between 0° and a certain critical 
angle a the sine of which will be 
equal to the ratio of the index of 
refraction of the material under in- 
vestigation to that of the glass of 
the prism (see Fig. 286). The criti- 
cal angle is indicated by a sharp 
line in the focal plane of the tube. Fig. 286 
ributions of a given atom to the value of a given phy- 


* These are the cont 
sical quantity. 
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To measure the index of refraction of a liquid, we must place 
a 0.5-mm layer of it on the surface of the prism. When measuring 
solids, the material must make close contact with the surface of the 
prism. Optical contact is achieved by using drops of an appropriate 
liquid between the surface of the prism and that of the material 
under investigation. The index of refraction of a powder may be 
determined by immersing the powder in a liquid whose index of 
refraction is the same as that of the powder. 


262. Pyroelectrie and Piezoelectric Materials 


A crystal which does not have a centre of inversion included in 
its symmetry elements may possess a number of interesting proper- 
ties. Such crystals may have an electric moment (polarisation vector) 
in the absence of an external field. 

First, let us direct our attention to crystals which become polar- 
ised with homogeneous deformation. This property is characteristic 
of piezoelectric crystals, which were discussed in Sec. 45. 


The occurrence of polarisation on com- 


z pression, extension, etc., shows that ho- 
4 mogeneous deformation results in the 
teed creation ofa special, i.e., single (not mul- 


tiplied by the number of symmetry ele- 

ments) direction. Such behaviour is not 

possible when a crystal has a centre of 

LA] inversion. Homogeneous deformation can- 

hag) not eliminate a crystal’s centre of inver- 
sion. At the same time, the existence of a cen- 
tre of inversion is incompatible with the 
existence of a special direction 


tion such as the 

polarisation vector direction. Any crystal 

that does not have a centre of symmetry 

may possess piezoelectric properties. Nev- 

ertheless such properties are not found in 

many crystals of this type. This may be 

Fig. 287 due to the fact that instruments are not 

sufficiently sensitive. However, we may 

conceive of a noncentral symmetric structure in which a homoge- 

neous deformation does not displace the centre of gravity of 

positive charge relative to the centre of gravity of negative charge. 

Close examination shows that the piezoelectric effect is not possible 
in one of the noncentral symmetric groups of symmetry. 


* This is a deformation in which all volume elements are deformed in the 
same manner. 
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The most common piezoelectric material is quartz. Fig. 287 
shows how piezoelectric plates may be cut from a quartz crystal. 
The nature of atom displacements can be assessed from Fig. 288. 
The structure of quartz may be pictured as a compact packing of 
oxygen ions in the empty spaces of which silicon atoms are located. 
(In the figure, silicon atoms are represented as black spheres and oxy- 
gen atoms as white spheres.) The oxygen ions carry a negative charge 
and the silicon ions a positive charge. A silicon atom is surrounded 
by four oxygen atoms. Electrisation of quartz occurs when it is com- 
pressed along the polar axes. Pressure applied along axes lying in 
the plane of the figure results in the dis- ` 
placement of its positive charge relative 
to its negative charge. Pressure along an 
axis of the third order (a nonpolar direction 
perpendicular to the plane of the figure) is 
ineffective. 

Atom displacements cannot be shown in 
the figure. These displacements are quite 
negligible and cannot be detected by objec- 
Live methods (e-g., X-ray structural analysis). 
The piezoelectric constant of quartz, i.e., 
the magnitude of the polarisation vector 
for unit pressure, is equal to 6.5X 10-8 Gaussian unit. The volume of 
a unit cell of quartz is equal to 112x 10-24 cm*. Since there are three 
SiO, molecules in a cell, the number of molecules in a unit volume is 
equal to 2.7 X 10% and, therefore, the dipole moment per molecule 
for unit pressure is equal to 2.4 X 10-8° Gaussian unit. The charge 
of a molecule is equal to 14 + 2 X 8 = 30 electron units. Therefore, 
the displacement of the centre of gravity of positive charge relative 
to the centre of gravity of negative charge is a negligible quantity 
of the order of 107" cm. 

Piezoelectric crystals include a class of materials known as pyro- 
electric crystals. Such crystals are naturally polarised under normal 
temperature and pressure. Usually this effect is masked by the free 
surface charge which accumulates along the boundaries of the crys- 
tal, but it may be detected when the temperature of the crystal is 
raised. Hence the designation pyroelectric (pyro means fire). 

Pyroelectric crystals have even more restricted symmetry. Only 
a crystal having a special axis can be termed pyroelectric. Thus, the 
mere absence of a centre of symmetry is insufficient. The significance 
of this condition is evident. The presence of natural polarisation 
indicates that such a special direction is present in pyroelectric crys- 
tals, while in the case of piezoelectric crystals such a direction 
appears only under the action of mechanical deformation. One of 
the most common pyroelectric substances is tourmaline. 


Fig. 288 


E 
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A pyroelectric crystal has a very strong internal electric field. 
Therefore, the superposition of an external field does not change the 
polarisation of such a crystal, i.e., the polarisation cannot be in- 
creased, decreased or rotated. Such a crystal is polarised to saturation — 
all particle dipole moments are parallel. In the theory of ferromagnet- 
ism (see Sec. 266, which may be read with profit at this point). 
a region in which the magnetic moments of atoms are parallel is 
called a domain. The same term is applied to a region in which the 

electric dipole moments of all particles 

are parallel. A pyroelectric crystal usu- 

Erea ally constitutes a single domain, but there 

are exceptions. A number of substances 
may yield multidomain crystals. Such 
pyroelectric substances are known as 
Seignette electric, or ferroelectric, sub- 
stances. The former designation is de- 

LEZ rived from Seignette salt, in which B. V. 
Kurchatov and P. P. Kobeko first detect- 

esa O0 On ed this effect. The latter designation 
emphasises the close similarity between 
these substances and ferromagnetic ones. 
Both ferromagnetic and ferroelectric sub- 


Fig. 289 


stances have very large dielectric constants (values of several- 


hundred or even thousand), pronounced hysteresis effects, and 
Curie points. The discussion of Sec. 266 is completely applicable 
here and will not be repeated. The points made about the effect of 
a field, polarisation by the displacement of domain boundaries, and 


the reasons for the division of a crystal into small domains all apply 
to ferroelectric materials as well. 


A large number of ionic pyroelectric crystals are ferroelectric. 
Several of these are particularly suitable to demonstrate ferroelectric 
properties. Barium titanate, BaTiO;, is typical in this respect. 
At 120°C, barium titanate loses its special properties and becomes an 
ordinary dielectric. At temperatures above 120°C, this substance has 
the simple unit cell shown in Fig. 289. The cell is cubic; at the centre 
there is a titanium atom, at the cornersof the cube barium atoms, and 
at the face centres oxygen atoms. The cell has a central symmetric 
structure; above 120’, the crystals no longer exhibit pyroelectric prop- 
erties. When the temperature is reduced a phase transition occurs 
and the structure changes: one of the cube edges becomes 1 per cent 
longer than the other two and the cube is transformed into a tetra- 
hedron. In this process, the titanium atom is displaced in the direc- 
tion of one of the oxygen atoms. This now becomes the special direc- 
tion and the polarisation vector will be parallel to this line. A bari- 
um titanate crystal has three directions of weak polarisation, rather 
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than one, since the displacements along the three axes of the cube 
are equal. 

When the substance is cooled below the Curie point (120°C for 
a barium titanate crystal) different regions of the crystal may be 
transformed into domains with different orientations. A crystal which 
has acquired a domain structure is in a state of mechanical stress, 
i.e., some portions of the crystal are compressed and others extended. 
Strictly speaking, a domain crystal is not a monocrystal—three-di- 
mensional long-range order throughout the crystal] is no longer present- 

On further reduction of temperature, barium titanate undergoes 
yet another phase transformation at about -+10°, but it does not 
cease to be ferroelectric. 

Seignette salt behaves differently. It possesses ferroelectric proper- 
ties only within a narrower temperature interval—from —20°C 
to + 24°C. s 


CHAPTER XXXVI 
MAGNETIC SUBSTANCES 


- 263. Three Groups of Magnetic Substances 


Substances may be classed into three groups in accordance with 
their magnetic properties: diamagnetic, paramagnetic and ferromag- 
netic. The values of diamagnetic susceptibility lie in the range of 
—13 x 40-® (bismuth) to —0.8 x 10-8 (copper). Paramagnetic bod- 
ies are characterised by positive susceptibility—for example, 0.4 x 
Xx 10-8 (potassium) and 320 x 40-® (iron chloride). Ferromagnetic 
bodies are characterised by large values of permeability. These are 
hundreds and even thousands of times greater than those of other 
bodies. Let us examine the structural features which explain these 
great differences in magnetic properties for substances which other- 
wise do not show great differences in properties. 

Diamagnetism, it will soon be seen, is a universal property of all 
bodies inasmuch as they consist of electrons. The above values show 
that diamagnetic properties are weaker than paramagnetic ones and, 
a fortiori, weaker than ferromagnetic properties. Diamagnetic proper- 
ties may be detected only in the absence of properties resulting in 
positive magnetism. Paramagnetic and fer agnetic bodies have 
diamagnetic properties, but they are obscure l by the stronger positive 
paramagnetism. Thus, diamagnetism exists for any system containing 
electrons. On the other hand, positive magnetism arises only in 
bodies the atoms of which possess a magnetic moment. The phenom- 
enon of paramagnetism is very similar to. the process of clectrisa- 
tion of a dielectric, which consists of rigid dipoles Possessing a con- 
stant dipole moment. 

The presence of a magnetic moment in atoms is also a necessary 
condition for the existence of ferromagnetic properties. However, the 
peculiarities of ferromagnetic substances are due to a very specific 
property, viz., the formation within a body of vast regions—do- 
mains—within which the magnetic moments of thousands of millions 
of atoms are arranged parallel to one another. 


264. Diamagnetism 


Diamagnetism is a direct consequence of the tendency for an elec- 
tron to move in a circle in a magnetic field. 
In a magnetic field with an induction B, an unbound charged par- 


s ; K ' eB 
ticle moves in a circle with an angular frequency Op. Tt can 
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be rigorously proved that the action of a magnetic field on an elec- 
tron moving in a central field—in particular, in the field of an atomic 
nucleus—produces an analogous effect: the electron will move in 
a circle about a line of force, but at one-half the frequency, viz., pa ; 
ne 
This motion is superimposed on other motions which may be per- 
formed by the electron: the chaotic motion of particles of the electron 
gas or the motion of the electron about an atomic nucleus. 

The fundamental considerations discussed on p. 494 showed that 
such motion may be equated to a circular electric current. When the 
magnetic field is switched on, the electrons begin to rotate about the 
magnetic field and each produces an 
elementary current yi 


Multiplying this value by the area of the = 

circle described by an electron in its 

motion about a line of force, we obtain U 

the value of the diamagnetic moment I 

created by one electron: 
Wise 1 eo we 


c QT 4ume? 


e2 


he reason for the minus sign is clear 
from Fig. 290: the direction of the moment 
is opposile to that of the field. 

When a system consists of a large num- 
ber of electrons, we must take the summation of the above 


expression with respect to all the electrons: 


ə 


M= ~~ Zame? » SB. 


Since by definition (see p. 290) magnetic susceptibility is equal to 
the ratio of magnetic moment per unit volume (or unit mass or mole) 


to induction, 
Nee wy 
pare DA 4nme? 2 Si. 


If N is Avogadro’s number, x represents molar diamagnetic suscep- 


B 
Fig. 290 


tibility (in comparing with the results on p. 290, note that x ==) s 


Thus, y is given by the areas circumscribed by electrons in their 
secondary motion in the magnetic field. In principle, this computa- 
tion can be made if we know the wave function ofthe system, i.e., in 
ihe final analysis, the electron density. Actually, since the computa- 


668 Magnetic Substances 


tion is very cumbersome, the diamagnetic susceptibility is deler- 
mined experimentally. 

It should be emphasised that diamagnetic susceptibility is deter- 
mined by the electron structure of the system and does nol depend 
(at least for atoms and molecules) on external conditions, including 
temperature. 

Diamagnetic susceptibility, like molecular refraction, possesses 
additivity. If the diamagnetic susceptibility is taken for a mole of 
substance, the susceptibility x of a molecule may be expressed with 
considerable accuracy as 


ioe > NAKA: 


where n4 is the number of atoms of type A in the molecule and Ya 
is the increment for the given atom. For purposes of illustration, we 
can use the same example as for refraction (see p- 664). C, H and CI 
atoms have the increments 7.4, 2.0 and 18.5 (xa X 10°), respective- 
ly. Thus, we obtain 15.4 for methane, 64.9 for chloroform, and 81.4 
for carbon tetrachloride. These values are in close agreement witb 
experimental results. 

À The significance of this additivity consists probably in the follow- 
ing: outer electrons weakly affect diamagnetic susceptibility. In 
so far-as additivity is realised, diamagnetic susceptibility is an 
atomic rather than a molecular property. 

Diamagnetic susceptibility, as indicated in the preceding article, 
is a property associated with substances the atoms and molecules 
of which do not have a constant magnetic moment. Such particles 
include in the first place atoms and ions with completed shells— 
the ions F~, Cl- and Nat and atoms of the noble gases. Atoms and 
ions which in addition to a completed shell contain two more s-clec- 
trons with anti-parallel spins, e.g., Zn, Be, Ca and Pb*+, are also 
diamagnetic. s Ki 

The group of diamagnetic molecules is incomparably larger than 
the group of paramagnetic molecules. The latter exists more in the 
nature of exceptions. This is due to the fact that practically all mole- 
cules have valent bonds formed by a pair of electrons with anti-par- 
allel spins. Usually, the total moment about a nucleus, as well 
as the spin moment, equals zero in such molecules. Thus, bodies 
consisting of atoms and ions such as those cited above and practical- 
ly all bodies the building blocks of which are molecules— therefore, 
practically all organic substances—are diamagnetic. 

Diamagnetic susceptibility describes the electron cloud of a mole- 
cule. If the distribution of electrons in a molecule is strongly aniso- 
tropic, its magnetic susceptibility is also anisotropic. The anisotropy 
of diamagnetic susceptibility is manifested particularly in molecules 
of the aromatic compounds. For example, in benzene, Xi the molar 
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diamagnetic susceptibility in a direction lying in the plane of a ben- 
zene ving, equals —37 Xx 107° cmë/mole and xy, the molar diamagne- 
tic susceptibility in a direction perpendicular to the plane of a ring, 
equals —91 x 10-® cm*/mole; in naphthalene yj = —40 x 

10-5 cm3/mole and yı = —190 x 107° cm’/mole. Anisotropy may 
he detected by measuring crystals oriented in different directions in 
the field. Measurements of powders, liquids and gases yield a value 
of magnetic susceptibility for an averaged orientation. 


265. Paramagnetism 


A substance has paramagnetic properties if the atoms, ions or mole- 
cules of which it consists possess a magnetic moment. A magnetic 
moment is due either to the uncompensated spins of electrons in the 
atomic system or to the motion of electrons about nuclei, or both. 

As was explained earlier (see p. 498), a magnetic moment resulting 
from spin is related to angular momentum as follows: 


Us = 2}ty Vs(s+), 


and a magnetic moment resulting from the motion of electrons about 
a nucleus is related to angular momentum as follows: 


Lp = us V D(L+ 1). 


Here, tty is the Bohr magneton and s and L are, respectively, the 
total spin momentum and the total angular momentum for motion 
about a nucleus, taken for an atom or molecule as a whole. As pre- 


š . h 4 
viously, s and Z are expressed in units of an When paramagnetism 


is due to both effects, the formula for the magnetic moment of an 
atom or molecule takes the form 


u=gun VJ (J +1), 


where J is the quantum number of the total quantum momentum, 
i.e., the vector sum of Zand s, andg is the Lande factor, which depends 
on all three quantum numbers. Incidentally, the proximity of g to 
1 ori 2 (established experimentally) is an excellent indicator of the 
origan of the magnetism of a given substance. 

Pramagnetic atoms and ions include particles having one electron 
pleted shell (e.g., atoms of the alkaline metals), 


over and above a com] o 
atoms of the transition elements, ions of the rare earth elements 


with incomplete shells, ete. — i i 
Most molecules, as already indicated, are diamagnetic. Molecules 


of oxygen and sulphur, which are paramagnetic, are exceptions and 
have a total spin equal to 4. The magnetic moment obtained experi- 
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mentally is in close agreement with the value calculated by means 
of the formula 


Brie TD: 
= 2p, V 2. 


The presence of paramagnetism is proof of the fact that the mole- 
cules contain unpaired electrons. This circumstance makes the meas- 
urement of the magnetic properties of molecules of great interest 
to the chemist. The so-called free radicals, which are chemical com- 
pounds with an unpaired electron, possess paramagnetic properties. 
Free radicals are created in a number of instances in chemical reac- 
tions, and the measurement of magnetic susceptibility is a possible 
method of studying the course of chemical reactions. 

How is the value of the paramagnetic moment of a molecule relat- 
ed to that of magnetic susceptibility? In paramagnetic bodies located 
outside a magnetic field, the magnetic moments are distributed ran- 
domly with respect to direction, and the total magnetic moment of 
a substance is equal to zero. When a field is switched on, the atoms 
(or molecules) will tend to rotate in such a way that their magnelic 
moment coincides with the direction of the field. As a result, equi- 
librium is established between two tendencies: the ordering action 
of the field and the tendency to thermal randomness, The reasoning: 
used on p. 656 to derive the value of the polarisability of a substance 
consisting of rigid electric dipoles is completely applicable here- 
Therefore, like in that case, the relationship between the magnetic 
moment of an atom (or molecule) and the paramagnetic susceptibility 
of an atom is given by the expression 

ale 

Yatom = ZET ` 
In contradistinction to diamagnetic susceptibility, the paramagnet- 
ism of a substance depends on temperature. To be sure, the situation 
here is‘somewhat more complex than in the case of dielectrics. This 
is due to the fact that the electric moment of a molecule is a constant y 
while the magnetic moment of a molecule (or atom) may vary consid- 
erably with the temperature. Paramagnetic moment is related to 
quantum numbers, and the distribution of molecules according to 
state may depend greatly on temperature. Therefore, the simple law 
that magnetic susceptibility is inversely proportional to temperature 
(the Curie Law) may not be valid in the case of paramagnetic sub- 
stances. 


266. Ferromagnetism 


Domain. A small number of substances possess marked (using coarse 
observation methods) magnetic properties. These substances 
include iron, cobalt, nickel, gadolinium, compounds of these ele- 
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ments, and certain compounds of manganese and chromium. Since 
iron is the most important of these, such substances are said to be 
ferromagnetic. 

Atoms of a ferromagnetic substance have a magnetic moment, 
which, moreover, is caused by spin (at least, basically). However, 
it is not this feature that distinguishes it from a paramagnetic sub- 
stance. The main characteristic of a ferromagnetic substance is its 
domain structure. A domain is a 
region which is magnetised to 
saturation, i.e., a region in which 
all atoms are arranged with their 
magnetic moments parallel. Since 
the linear dimensions of do- 
mains are usually of the order of 
0.01 mm, they may be observed 
by means of an ordinary micro- 
scope. 

Domains exist in a ferromag- 
netic substance in the presence 
as well as in the absence of a 
field. In order to observe do- Rin. 291 
mains, we place a drop of colloidal Bee 
suspension—a finely divided sub- 
stance such as magnetite (Fe,;0,)—on the polished surface of a ferro- 


magnetic monocrystal. Colloidal particles become concentrated close 
to the boundaries of the domains since strong local magnetic fields 
exist along such boundaries (as in the case of any bar magnet) and 
they attract the grains of magnetite (see Fig: 291). 

First, let us consider certain problems arising in connection with 
one domain; then we shall study the arrangement of domains in crys- 
tals; and finally we shall examine the process of magnetisation of 
a ferromagnetic substance. poe 

The orientations of the magnetic moments of atoms forming a sin- 
gle domain are not arbitrary. Every crystal of aferromagnetic substance 
has a particular crystallographic direction along which it is most 
easily magnetised. In hexagonal cobalt this is a single direction—the~ 
hexagonal axis. In cubic iron this direction is the edge of a cube. 
This means that there are three directions of easiest magnetisation 
and accordingly three directions of magnetic moments of domains. 
In cubic nickel the spatial diagonals of a cube are axes of easiest 
magnetisation, i.€., there are four possible directions of magnetic 


moment. 4 j 
Why is it that atoms in a ferromagnetic substance arrange them- 


selves so that their magnetic moments are parallel? This is caused by 
a specific phenomenon—the interchange of positions by electrons. 
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As indicated in connection with a chemical bond, the overlapping of 
wave functions results in a decrease in energy. Electrons occupy 
a common space and are able to interchange positions. The tendency 
of exchange energy to become minimal is the reason for the stability 
of most chemical compounds. Exchange energy plays an analogous 
role in the creation of a domain. In the case of a chemical bond, the 
minimum value of exchange energy is achieved when the spins of 
interchanging electrons are anti-parallel. However, the general con- 
clusion of quantum mechanics is broader, i.e., the exchange energy 
may in certain cases be minimal for parallel orientation of spin and 
in other cases for anti-parallel orientation of spin. In ferromagnetic 
substances, the spins of atoms contained in a domain have a parallel 
orientation. Comparatively recently, a new class of com pounds— 
anti-ferromagnetic substances—was discovered. In these substances, 
stable domain states occur for anti-parallel orientation of spin. 
From measured values of the magnetisation of a domain, one can 
calculate the number of spins per atom involved in ferromagnetism. 
Such numbers are not whole numbers (for iron 2.2, for cobalt 4.7, 
for gadolinium 7.1, etc.). It must be concluded that to a certain 
extent, the electrons forming an electron gas are also involved in the 
creation of ferromagnetism. However, in the main, electrons bound 
to atoms are responsible for ferromagnetism. In iron, conduction 
electrons come from the outer 4s shell, while ferromagnetic electrons 
are in the 3d shell. i 
The existence of remarkable materials known as ferrites constitutes 
direct proof of the absence of any connection between conduction 
properties and ferromagnetism. These materials are semiconductors 
with a specific resistance of 10 to 14 orders of magnitude greater than 
iron. Conduction electrons, of course, play no role in the magnetism 
of these substances. Ferrites are mixed compounds; for example, 
manganese ferrite is a 1:4 mixture of manganese oxide and iron 
oxide, and nickel ferrite is an analogous mixture of nickel oxide and 
iron oxide. Iron oxide contains two iron atoms, and nickel oxide one 
nickel atom. A crystal of the mixture represents a compact packing 
of oxygen atoms. The nickel atoms and the two iron atoms fit into 
the empty spaces. It was indicated on p. 600 that there are two kinds 
of empty spaces in a compact packing arrangement, viz., tetrahe- 
dral and octahedral. An atom which fits into an empty space of the 
first kind is surrounded by four neighbours, while an atom in an 
octahedral space has six neighbours. The iron atoms fit into both 
kinds of spaces. The magnetic moments of the iron atoms are quite 
ordered, but the moments of iron atoms in tetrahedral spaces point 
in one direction while the moments of iron atoms in octahedral spaces 
point oppositely. As a result, the actions of these two systems 
of moments cancel each other and the magnetic properties of such 
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a mixed oxide result from the magnetism of nickel, the moments of 
whose atoms are all pointed in one direction. 

The presence of exchange energy explains the tendency of atoms 
to arrange Lhemselyes so that their spins are parallel or anti-parallel. 
Apparently, in ferromagnetic 
substances, Lhe exchange en- 
ergy of interaction becomes 
of prime importance and 
causes the substance to havea 
spin arrangement such that 
the energy assumes a mini- 
mum value. In the remaining 
paramagnetic substances, oth- 
er components of interaction 
energy do not allow the ex- 
change energy to make itself 
felt. 

The long-range order of 
atoms is destroyed at a cer- 
tain temperature: the crys- 
tal becomes fused. Temper- 
ature affects the arrange- 
mentofthe magnetic moments y 
in exactly the same manner. ginaya 
Fig. 292 show schematically 
how the magnetic moments of the atoms behave when 
the temperature is raised. At first the vibrations are in phase, 
then disorder begins to prevail, and finally the magnetic order 
“melts away”. Beginning at a definite temperature called the Curie 
point, in honour of the outstanding French scientist Pierre Curie, 
the order in the arrangement of arrows disappears and the substance 
loses its magnetic properties, i.e., the ferromagnetic substance turns 
into a paramagnetic substance. For iron the Curie point lies at 770°C, 
for cobalt at 1,115°C, for nickel at 358°C and for gadolinium at 15°C. 

In an anti-ferromagnetic substance, the spin of atoms tends to 
assume an orderly, but anti-parallel, arrangement. The structure of 
a domain of manganese oxide, which is an anti-ferromagnetic sub- 
stance, is shown in Fig. 293. Arrows represent the moment of manga- 
nese. From the figure, we see that the chemical period of structural 
repetition is one-half of the magnetic period. At absolute zero each 
atomic magnet of the anti-ferromagnetic substance is surrounded 

i ly directed moments. As in the case of a fer- 


by atoms with oppositely | r , i 
romagnetic substance, this orderis destroyed at a definite Curie 
temperature and above this critical point it behaves like a para- 


magnetic substance. 


43—1409 


674 Magnetie Substances 


The existence of various anomalies in the behaviour of a body in 
passing through the Curie point is indirect evidence of the existence 
of anti-ferromagnetic properties. Since the Curie point is a point of 
phase transition of the second kind} a number of properties undergo 
an abrupt change in passing through it. 

Direct evidence of the existence of anti-ferromagnelic properties 
has been obtained by means of neutron diffraction methods. The 
scattering of neutrons by a lattice (see Fig. 293) is sensitive to the 


Chemical period |<— 
Magnetic period 


Fig. 293 


chemical period, rather than to the magnetic period, of structural 
repetition. 

Domain Structure of a Crystal. In examining the domain structure 
of a ferromagnetic monocrystal by the powder method, which was 
described earlier, we note that a domain is never very large, i.e., its 
linear dimensions are usually no greater than 0.01 mm. It is found, 
moreover, that cubic ferromagnetic substances have extraordinarily 
symmetric combinations of differently oriented domains. These 
two circumstances require explanation since, it would seem, that 
thanks to the ease of magnetisation an entire crystal should be trans- 
formed into a single domain , 

L. D. Landau and E. M. Lifshits have shown that a domain struc- 
ture of the kind shown in Fig. 291 is a natural consequence of the 
existence of different energy forms in a ferromagnetic body. The 
essence of the theory is illustrated in Fig. 294. The first diagram 
corresponds to a single domain, the magnetic energy of which is 
4 


EF f H?’dı. But the energy corresponding to the second configuration 
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is only one-half of this value. In the case of N parallel domains, the 
sale ; 

energy will be about W of that for a single domain. However, this 
dividing process will be advantageous only up to a certain point. 
Beyond that point, the energy of boundary layers exceeds the decrease 
in energy associated 

with the division of a ` AAN 

crystal into domains. The NN NN WN S'S 

advantage of configura- 

tions which consist of 


circuits is evident. In 
such cases, a closed mag- 
netic flux circuit is 
formed and the energy of SS 8S SS NN 
the field outside the crys- A 
tal equals zero. 

In the case of cobalt, 
which has a magnetisa- 
tion direction along its 
axis, we encounter do- 
mains in which the mo- 

ments are oriented only 
along the axis ofa hex- 
agon. The zero magnet- 
ic moment of a body 
in the absence of an 
external field is realised 
as follows: half of the 
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domains have one orientation and the other half the op- 
posite orientation. 

A few words regarding the boundary between domains. This bound- 
ary layer is shown schematically in Fig. 295. We see that in this 
layer the magnetic moments gradually change direction. The thick- 

i ness of the layer is determined by the requirement for minimum 
energy. Two opposite tendencies occur here. On the one hand, it is 
desirable to extend over a thick layer—this being of greater advantage 
with respect to exchange energy—the disadvantageous process 
of spin turning. On the other hand, itis better to complete this proc- 
ess rapidly since in the transition layer the spins are at an angle 
to the directions of easiest magnetisation. 

Now, let us consider what happens in a ferromagnetic substance 
when an external field is switched on. The magnetisation process 

by the powder technique. It transpires that the 

f magnetisation consists in the growth of a domain, 
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may be followed 
| basic mechanism 0 
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which points in the “required” direction, by means of boundary dis- 
placement. The-domains which form an acute angle with the field 


SSSA 


Fig. 295 


“swallow up” those which form an obtuse angle with the field. At 
the beginning of magnetisation, domain boundary displacement is 


Fig. 296 


reversible, i.e., when the field is switched off 
the initial boundaries of the domains are re- 
stored. Later on, domain boundary displacement 
becomes irreversible. Finally, when a very high 
degree of magnetisation is reached, the direc- 
tion of magnetisation of the domains be- 
gins to turn. This is illustrated in Fig. 296. 
In polycrystalline substances, the situation 
is exactly the same (assuming the crystal par- 
ticles are not extremely small since domains 
are not formed when the size of the particles 
are less than 10-° cm), i.e., each grain may 
consist of several domains. In so far as the 
crystallographic axes of the particles have 
random orientations in the body, however, 
the magnetic moments of the domains orient 
themselves randomly. Thus, the simple mag- 
netisation diagrams which have come down 
to us from the days of Ampère provide a cor- 
rect picture of polycrystalline substances. 


CHAPTER XXXVII 
EFFECT OF ELECTRON STRUCTURE ON PROPERTIES 
OF BODIES 


267. Free Electrons 


Until now, in discussing the structure of solids and liquids, we did 
not pay particular attention to the role of electrons in the formation 
of the properties and structure of these bodies. We were able to do 
this because the electron structure of bodies is by no means always 
of prime importance. Ina number of cases, however, the role of elec- 
trons must be taken into account. There are two “kinds” of electrons 
in a body, viz., bound electrons and unbound (free) electrons. Bound 
electrons are component parts of a specific atom, ion or molecule. 
Unbound electrons belong to the entire crystal or liquid and may 
move quite freely between atoms. 

In molecular substances, the picture of electron structure is 
particularly clear. In most cases, there are no common electrons, 
i.e., none of the electrons leaves the “bounds” of a molecule. In 
ionic crystals, the restriction of electrons is not quite so clear. Even 
according to the classical view ofan ionic bond, it cannot be assumed 
that electron exchange is completely absent. Nevertheless, elec- 
trons passing from ion to ion (exchange electrons) in ionic crystals do 
not behave like free electrons; their displacement in such a crystal 
consists in the transfer of an electron from one atom to its neigh- 
bour. This is quite clear in crystals having a homopolar bond. Diamond 
is an insulator, although the electrons binding the carbon atoms 
are by no means restricted to specific positions, but are relayed from 
alom to atom. 

Metals differ quite considerably from all of the bodies mentioned 
above. Here, we encounter electrons for which the term “free” is 
entirely justified. Electrons are displaced in a metal just like gas 
particles in a tube filled with obstructions. Atomic products (ions) 
in a state of thermal vibration act as obstructions. The presence of 
free electrons is revealed primarily in conduction phenomena as well 
as all experiments involving the escape of electrons from a body. 
This type of phenomenon could not be explained without considering 
the peculiar behaviour of common electrons. 

It would be incorrect, of course, to assume that the division of 
electrons into bound and free electrons is absolute. This may rather 
be considered an idealised division. In solids we may encounter elec- 
trons which are bound in various degrees. This became particularly 
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evident when physics assigned a rightful place to semiconductors, 
which occupy an intermediate position between a system of ideally 
free electrons and a system of exchange electrons, or a system of 
electrons bound to molecules. It is now known that any transi- 
tional type of structure is possible. 

It should be noted that an electron in a solid, just like an atomic 
electron, obeys the laws of wave mechanics. The representation of 
an electron as a spherule has validity within the limits imposed’ by 
the principle of uncertainty. Usually it is physically meaningless 
to speak of the path of an electron inside a metal. A description of 
the electron structure of a body consists primarily in indicating 
how its electrons are distributed according to energy. 

Theory shows that the electrons of a body may be represented as an 
electron gas, but this statement must be properly qualified. It tran- 
spired that it is possible to picture the electrons of a metal as a gas 
of fictitious particles, the effective mass of such a particle being 
dependent on its direction of motion. This point is made in order to 
caution the reader against making a superficial analogy between an 
electron gas and a gas consisting of molecules. 


268. Energy Levels in a Solid 


The energy levels of a free atom were discussed earlier. Such 
energy levels may be determined experimentally, i.e., by observing 
the energy transitions which occur when light is emitted or absorbed. 
When an atom possesses many electrons, a unique group of four 
quantum numbers is associated with each electron; according to the 
Pauli exclusion principle only one electron may exist in a given quan- 
tum state. Therefore, energy levels have a limited capacity. The s 
levels of an atom may contain two electrons, the p levels six, etc- 
This information may be determined experimentally and as a conse- 
quence of the fundamental laws of quantum mechanics. 

To determine the energy levels of a system consisting of a large 
number of atoms, we should use both approaches here as well. The 
fundamental theoretical concepts remain unchanged in the case of 
a system consisting of thousands of millions of atoms. Therefore, it 
may be concluded that the number of quantum states in a system 
consisting ofn atoms will be z times the number in a free atom. Thus, 

_ the Pauli exclusion principle can be satisfied: only one electron 
will exist in a given quantum state. 

An atom never completely loses its identity in a given body. Spec- 
tral investigations indicate that significant changes affect only outer, 
valence electrons, which are vesponsible for interaction between 
atoms. Therefore, the quantum states of a solid must be closely relat- 
‘ed to the quantum states of an atom. Let us consider, for example, 
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the electrons of a K shell, which is closest to the nucleus of an atom. 
It is evident that the state of such an electron can change only very 
little when atoms are combined in a body. Nevertheless, the Pauli 
exclusion principle does not allow us to consider all Æ electrons as 
equal. It becomes necessary to assume that z extremely close K 
levels, cach of which consists of a pair of electrons of opposite spin, 
exist in a body consisting of n atoms. 

This reasoning also carries over to the other energy levels. It is 
assumed that the quantum states of a body are related to that of an 
atom by the following rule: 

a body consisting of n atoms 

has n times as many energy 3R 
levels as an individual atom. 
Each level of a free atom 
yields r close levels in a solid 
body. This means that the ener- 
gy levels of a body may be 2P 
viewed as a system of bands. 25 
Bach band is a split level of 

an atom. Therefore, the same 
designations may be used for 

the band as in atomic spectro- 1S 
scopy: 1s, 2s, 2p, etc. The 
number of electronsin a band 1/r 
will be, of course, n times the 
number of electrons in the 
corresponding shell of an atom. 
Thus, in the 4s and 2s bands there will be 2n electrons, in the 2p 
band 6n electrons, ete. 

The width of a band depends on the interaction forces between 
atoms. This concept is illustrated schematically in Fig. 297. The 
energy levels of a sodium atom are shown at the left and the expan- 
sion of the levels into bands in the formation of a crystal lattice are 
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shown at the right. The quantity £ is plotted along the abscissa. 


Perceptible expansion of the 1s level does not occur since the required 
interatomic spacings are absolutely unrealisable. The 2s and 2p 
bands are also practically unexpanded under normal conditions 
(indicated by a dotted line). On the other hand, the 3s and 3p bands 
are expanded to such an extent that they overlap. This means that 
the interaction which occurs between sodium atoms under normal 
conditions affects only outer electrons. (Sodium has no electrons in 
the 3p state. Nevertheless, we shall also be concerned with unoccu- 
pied energy levels when the exciting energy is sufficient to transfer 
an electron to such a level.) / 


680 Effect of Electron Structure on Properties of Bodies 


What is the significance of the overlapping of the 3s and 3p bands? 
Actually, our scheme of correspondence between the energy levels of 
an atom and a solid fails in this case. However, we shall not let this 
disturb us. The overlapping of the bands signifies thal the wave func- 
tion properties of an electron in the overlapping region differs from 
the wave function properties of an atomic electron. Thus, the outer 
electron of a free sodium atom is an s electron. In liquid and 
solid sodium the 3s and 3p bands overlap; the behaviour of the outer 
electrons of sodium differs from the behaviour of an s electron, i.e., 
certain special (hybrid) properties appear (such electrons reflect the 

peculiarities of s and p wave func- 
nle) tions). 

The described behaviour may be 
established experimentally by means 
of spectral methods. The- presence of 

E — an energy band rather than a distinct 
energy level can be established by 
n(é) examining the transition of electrons 
from a high band to a low one. That 
which would have produced a sharp 
line in the case of a free atom now pro- 
== duces a broad spectral band. 

It is more convenient to examine 

? transitions from an energy band to a 
nie) single distinct level—for example, in 
the case of sodium, transitions to the 
2p level. The spectral band obtained 
in this manner enables us to deter- 
mine the width of the energy band as 

Fig. 298 well as the electron distribution ac- 

cording to energy. In the caseof sodium, 

thisrequires that electrons be dislodged from the 2p shell. The frequen- 

cies of the resulting transitions lie in the region of soft X-rays (sev- 

eral hundred Angstroms) and are very difficult to detect. Special 

X-ray tubes, in which the anode seryes as the material under in- 
vestigation, are used in such studies. 

From measured values of the intensity of the obtained spectral 
band, we may plot a curve of intensity as a function of the frequency 


E — 


v. Butv = L (where & is the transition energy, i.e., the energy rela- 


tive to the distinct level), and the intensity at a given v is proportion- 
al to the number of electrons having an energy ĝ. Curves of n (é) 
as a function of &, where n (€) is the fraction of electrons in the 
band having an energy between € and @ + dg, may be plotted 
from experimental data. Three typical curves are shown in Fig. 298. 
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The first curve corresponds to an energy band in which the maximum 
energy is sharply defined. This signifies that all lower energy levels 
are occupied. The abrupt drop in the curve indicates that the lower 
levels are filled to capacity (two electrons per level). The second curve 
is typical of elevated temperatures. In this case, the edge of the band 
is smeared and the order of magnitude of the width of the smeared 
region is equal to #7. This means that some of the electrons are in 
an excited slate and can occupy higher levels. The third curve, which 
shows two nonoverlapping bands, is very interesting. The lower 
band is filled and the upper one is partially occupied. A forbidden 
band is located between the two allowed bands. 
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It is evident from the preceding article that as far as solid-state 
theory is concerned only the upper energy bands are of interest, since 
electrons at lower levels practically do not take part in interactions 
between atoms. How can the behaviour of upper band electrons be 
described? Since we are dealing with a very large number of electrons. 
it is natural to use statistical physics methods and consider an aggre- 
gate of such electrons as a kind of gas. 

The state of each electron of such a gas may be represented by 
a point (Px, Py, Pz) in momentum space. The direction of motion of 
an electron is parallel to its radius vector p and the energy of an 
electron depends on ils momentum. In a crystal, the energy of an 
electron will depend on its direction of motion. Let us disregard this 
for a moment and assume that the electrons behave like free parti- 
cles. Despite the fact that this is a rough approximation, i.e., that 
we neglect the potential energy of the field in which the electrons 
move and the interaction between electrons, the results give us a good 
description, at least qualitatively, of the behaviour of the electrons 
of a solid which form an energy band. 

If the electrons are free, the relationship between their energy and 


i 5 T E A 
momentum is given by the formula ẹ = >—p*. This means that in 
> 2m 


momentum space a surface of equal energy is a sphere. It is custom- 
ary to call such a sphere a Fermi sphere, after the famous Italian 
physicist. As indicated in the preceding article Emax, the maximum 
energy of the electrons in a band, may be determined experimental- 
ly. We can say, therefore, that the states of an electron gas are con- 
tained in a sphere of radius Pmax = V 2mM@ max. Thus, it would not be 
incorrect to call this Fermi surface a surface of maximum energy. 

To qualitatively check the validity of this theory, let us estimate 
from the value of @max the number of electrons in a band. We may 
reason as follows. According to the principle of uncertainty, the 


a 
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projection of the momentum of a particle in a metal body of eas 
l 

P 
Therefore, in momentum space, the concept of a point should be 


dimension Æ cannot be determined with greater accuracy than 


replaced by the concept of a cell of volume = ; Where V is the volume 


of the metal body under consideration. One of the basic postulates of 
the theory is that such a cell represents a quantum state and that it 
can contain no more than two electrons of opposite spin. If there are 


3 i 4 3 N 
N electrons in a volume V in the band under consideration, then = 


2 
é N hs tie Ree a 
«cells are occupied, i.e., the volume SF . This is the volume of a Fer- 


mi sphere of radius Pmax- Thus, 


4 Sr N] N h3 
= (V 2m max) a 
_ Perfectly reasonable values of N may be obtained using this equa- 
tion. This means that the above assumptions are more or less valid. 


_ Example. In a metal, the maximum energy, determined experimentally, 
is Cmax ~ 10 ev = 16 Xx 10!* erg. Using this value, we obtain Pmax = 


= V2mEmax~ 2 X 10719 gm cm/sec, i.e., the maximum electron velocity 
in a metal is 


Pmax 2x 10-19 : 
O E “Ixo 2x 108 cm/sec. 
` 


Hence, the number of electrons in a unit volume is 


4 eet i: aN 
N=2 xz T (Vingmax) 7r ~ 1023. 


The abòve discussion assumed that the temperature is at absolute 
zero. At a higher temperature, electrons may pass over into momen- 
tum-space cells which correspond to higher energy. Such a transition 
will take place for electrons located in cells close to a Fermi surface 
(otherwise too high a transition energy is required, which is unlikely 
to obtain) and the boundary of the sphere will be broad (not distinct). 
Only at very high temperatures will the excitation affect low-energy 
electrons. As the temperature is increased, the degree of degeneracy of 
the electron gas decreases. An electron gas has a high degree of degen- 
eracy, particularly at low temperatures. The term “degeneracy” 
signifies that different quantum states have one and the same energy- 

The distribution of electrons according to energy at a given temper- 
ature may be calculated. This distribution differs from a Boltz- 
mann distribution. According to the Boltzmann law, at absolute 
zero, the energy of electrons should be equal to zero. From the view- 
point of the new theory, electrons should have a high energy at 
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absolute zero (this follows from the Pauli exclusion principle).* On 
the basis of the Pauli exclusion principle, we can construct a new 


statistics (Fermi-Dirac statistics), in which the function e 
replaced by the expression 
1 
as 
©— Omax 


kT 34 


e 


where @max is the maximum possible energy of the electrons at abso- 
lute zero. Multiplying this factor by the electron distribution at 
absolute zero yields the electron distribution at any temperature 


Fig. 299 


Fig. 299 shows the dependence of the Fermi-Dirac function on & 
when kT = 0.1 and 2.5 ev. 

It should be noted that different particles obey different statistics. 
Molecules obey Boltzmann statistics, photons Bose-Einstein statis- 
tics, and electrons (and other particles having a spin of 5) Fermi- 
Dirac statistics. 

The difference in statistical approaches consists in the different 
methods of distribution of particles according to their possible states. 

Let us assume that there are two states in which two particles 
may be possibly located. In Boltzmann statistics, where particles 
possess individuality, the following possibilities must be considered: 


* Emax has an order of magnitude of several electron-volts, while the av- 
erage energy of thermal motion (ÆT) is equal to several hundredths of an elec- 
tron-volt. Thus, electrons move rapidly even at absolute zero. The velocity 
of electrons at absolute zero is 1,000 times as great as the velocity of atoms at 
room temperature. This should be re-emphasised in order to make it clear that 
the relationship existing between kinetic energy and temperature in the case 
of molecules is not applicable in the case of electrons. It follows, moreover, 
that an electron gas has negligible thermal capacity. The thermal capacity of 
a body is not affected by the presence of an electron gas. 


= 
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1) both particles in the first state; 2) both particles in the second 
state; 3) the first particle in the first state and the second in the sec- 
ond state; 4) the first particle in the second state and the second 
in the first state. Thus, there are four possibilities in all. 

In Bose-Einstein statistics, particles are indislinguishable from one 
another. There are therefore three possibilities: 1) both particles in 
the first state; 2) both particles in the second state; 3) one particle 
in the first state and one in the other. 

In Fermi-Dirac statistics, the Pauli exclusion principle is taken 
into account: only one particle may be in a given state. The number 
of possible distributions is reduced to one, i.e., one particle in each 
of the two states. © 

Thus, outer electrons of the atoms of a solid behave like an elec- 
tron gas. This is a very peculiar kind of gas, i.e., ils particles obey 
Fermi-Dirac statistics. 


270. Conductivity 


In the absence of an electric field, the state of an electron gas is 
such that the number of electrons moving from right to left is equal 
to the number moving from left to right. When a field is applied, 
forces are produced which make the electrons move parallel to the 
field. The distribution of electrons in momentum space becomes 
nonsymmetrical with respect to the origin, i.e., a displacement 
occurs in the direction of the field. An ordered motion which pro- 
duces an electric current is superimposed on the extremely rapid 
random motion of the electrons. 

For the distribution of electrons to be displaced, electrons must, 
to be sure, pass to higher energy states. Such a transition is always 
possible if there are vacancies in the energy band. If the energy 
band is fully occupied, i.e., if all its levels are occupied by elec- 
trons as allowed by the Pauli exclusion principle, the electrons have 
no place to go—in any case, not until they acquire sufficient energy 
to make a transition to the next band. 

Were it not for the overlapping of bands (discussed above), one 
could assume that all elements having one valence electron are 
conductors and all elements giving up two electrons to be shared 
in the process of forming a solid are insulators. Thus, sodium has 
one electron at the 3s level. In the formation of a body consisting 
of N sodium atoms this level splits into V levels. At each level there 
may be two electrons of opposite spin, i.e., a total of 2N electrons. 
But since we have only N valence electrons, half of the energy band 
is unoccupied. Magnesium, the next element in the Mendeleyev 
periodic table, has two electrons (per atom) at the 3s level. There- 
fore, in the formation of a magnesium crystal, all levels would be 
occupied were it not for the overlapping of energy bands. 


oy E 
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Investigation of the form of energy bands of various elements 
shows that the above explanation of the origin of conducting prop- 
erties is completely valid. Only when the upper band or the merged 
bands are not fully occupied may the body be classified as a conductor. 

The distribution of the electrons of a conducting body in momen- 
tum space may be displaced in the direction of a field. Since the 
number of electrons moving in the direction of a field does not equal 
the number of electrons moving in the opposite direction, an elec- 
trie current is produced. In an insulator, all the energy bands are 
completely occupied. Ordinary field intensilies are incapable of 
creating the forces necessary to transfer electrons to the next higher 
band, and if we persist in trying to transfer electrons of an insula- 
lor to the next band the dielectric breaks down. The electron dis- 
tribution maintains ilssymmetry in momentum space and the num- 
ber of electrons moving to the left remains equal to the number of 
electrons moving to the right, i.e., no current flows. 

To return to the discussion of conductors, let us now roughly esti- 
mate the magnitude of the electrical conductivity of a body which 
has n free electrons in a unit volume. By free electrons or conduction 
electrons, we mean electrons located in unfilled energy bands. 

Assume that the motion of an electron under the action of an accel- 
erating force eZ occurs during a small time interval t = i Here, 


v is the velocity of an electron and / the length of its mean free path. 
The path is traversed at the extremely high random velocity of 
the electron. The velocity of the ordered motion of the electrons 
creating the electric current is many orders of magnitude less than 
the random velocity and, therefore, is not included in the denomi- 


. : . . ; eE x 
nator of the expression for t. Motion with an acceleration = during 


x r 7 s eB L om 
a lime interval increases the electron velocity to —--,. Thus, the 
approximate value of the velocity of the ordered motion of the 
š 5 eEl 
electrons creating the current is u ~~. 


The density of the electric current is simply the quantity of elec- 
tricity passing through a unit area per unit time, i.e., j = neu. 
Substituting the above value of u, we obtain 


z ne?l 
j= Es 
mv 


Since Ohm’s law in differential form is given by j = oF, a rela- 
tion may be obtained for electrical conductivity: 


ne?l 
mu 
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This gives us only a rough estimation of the electrical conductiv- 
ity. In view of the assumptions made for purposes of simplification, 
calculated and experimental results will differ by as much as an 
order of magnitude. However, we are interested merely in a qualita- 
tive picture. It can be seen that the conductivity is proportional to 
the number of free electrons. This number and the length of the free 
path may vary from substance to substance. 


Example. If the length of the free path of an electron in a metal is 1 ~ 
~ 10 A = 10-7 em and v is of the order of magnitude of 108 cm/sec (see the 
example on p. 682), the free path is traversed in a time t = 1071 sec. 

Assume that the voltage drop along a 1-em segment of a metal conductor 
with a 1-cm® cross-section is equal to 0.003 v = 10-® Gaussian unit. Then, 
E = 1075 Gaussian unit and the velocity of the ordered motion of an electron 


is u popeels ~ 5 X 1073 cm/sec. The current density is j= neu ~ 1023 x 
m 


ny 
X 4.8 x 10710 X 5 X 1073 ~ 30 x 1010 Gaussian unit= 100 a/cm?. This 
yields quite reasonable values of conductivity: 


o ~ 25 x 1015 Gaussian unit = 28 X 104 ohm-1 cm-1. 


If a crystal had an ideal lattice and the temperature approached 
absolute zero, there would be no restriction on the length of the free 
path and the material would have no electrical resistance. The elec- 
tron range is limited by atomic thermal vibrations and the presence 
of various crystal imperfections. Both factors disturb the ideal peri- 
odicity of the field in which an electron moves and result in the 
scattering of electrons. It follows that the conductivity of a body 
improves as its temperature decreases and approaches a limit which 
is determined by the degree of perfection of the crystal lattice. 

It can be shown experimentally that the resistance of a metal de- 
creases with temperature. This would indicate that the theory is val- 
id for metals. Moreover, the fact that electrical resistance decreases 
with temperature is an essential characteristic of metals. The plas- 
tic deformation of a metal, the impairment of its lattice by nuclear 
bombardment, and in general any action serving to damage the lat- 
tice will reduce the length of the free path and therefore result in 
increasing the electrical resistance. 

In Part I (p. 222), the thermal conductivity of gases was discussed. 
It was shown that the thermal conductivity of a gas is propor- 
tional to the length ofthe free path and is given by the formula 
x ~ pvle,. Is this formula useful for the calculation of the thermal 
conductivity of metals? Electrons are much lighter than atoms and 
one is justified in assuming that heat is transmitted by electrons 
which transfer energy from one atom to another. Since the length of 
the free path is not known, one cannot calculate the coefficient of 
thermal conductivity. However, it should be noted that the ratio 
of the coefficient of electrical conductivity to the coefficient of ther- 
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mal conductivity does not contain unknown parameters and depends 
only on universal constants and temperature: 


— = const T 
o 


(Wiedemann-Franz formula). Experimental results agree fairly 
closely with the value obtained by means of this formula. The follow- 


; : see e 
ing table gives the values of the quantity oF at O°C for a number 
of metals. 


Metal Ag | Au | Cu Mo Pb Pt Sn Zn 
4 ae, 
a x ios SO 2.31 | 2.35 | 2.23} 2.61 | 2.47 | 2.51 | 2.52 | 2.31 


The theoretical value of this quantity is equal to 2.45 x 10-8. 
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Since a crystal always has a considerable number of imperfections, 
it will usually possess a residual resistance, which is reached at 
a temperature of several degrees Kelvin, i.e., below this point the 
resistance does not decrease with temperature. However, there exist 
about ten metals which behave quite differently. At definite tempera- 
tures close to absolute zero, such metals completely lose their 
electrical resistance. When an electric current is induced in such 
a superconductor, the current will flow in the cireuit for days. This 
shows that the resistance has not simply decreased, but has dropped 
abruptly to zero. 

Of the pure metals, niobium has the highest temperature (9°K) 
and hafnium the lowest (0.3°K) at which superconductory proper- 
ties appear. 

It might seem that superconductivity is a property common to 
all metals, i.e., if the temperature is reduced sufficiently super- 
conducting properties will appear. This is apparently not so. The 
temperature of numerous materials has been reduced down to 0.03°K 
without superconducting properties being manifested. The stppo- 
sition that such properties are not universal is supported by the fact 
that superconducting metals occupy a definite part (the middle) 
of the Mendeleyev periodic table. 

Superconducting materials include, in addition to pure metals, 
numerous alloys of such nonsuperconducting metals. Moreover, 


7 


688 Effect of Electron Structure on Properties oj Bodies 


a chemical compound may be a superconductor even though neither 
of its components is one. Thus, copper sulphide is a superconductor, 
but copper and sulphur are not. Niobium nitride already reveals 
superconducting properties at 30° above absolute zero. 

The disappearance of electrical resistance at a temperature Fh is 
not the only peculiarity of superconductors. 

Another mark of a superconductor is its characteristic behaviour 
in a magnetic field: generally speaking, a magnetic field penetrates 
such a conductor to a depth of only about 1 ,OOOA. If very thin films, 
the behaviour of which is somewhat peculiar, are left out of consid- 
eration, we may make the following generalisation: the magnetic 
field inside a superconductor equals zero. 

However, this is true only as long as the applied external field 
does not exceed a certain critical value H}. When this value is ex- 
ceeded, the superconducting state disappears—the magnetic field 
penetrates the material and electrical resistance is restored. 

H, is a function of temperature, i.e., it is not a constant. At the 
temperature Tp, a very weak external field suffices to destroy the 
superconducting state. Generally speaking at T = T, the critical 
intensity Ha equals zero. H, gradually increases as the temperature 
is decreased, and at absolute zero it reaches its highest value. For 
example, the maximum value of the critical field intensity of mer- 
cury (Ta = 4.2°K) is 412 oersted. 

Electrical resistance is due to the scattering of electrons by the 
thermal waves of atoms in a crystal lattice. These thermal waves 
exist, as we know, because of the presence of a zero energy, even al 
absolute zero. It would seem, therefore, that electrical resistance 
should not disappear no matter how much the temperature is 
decreased. 

- How is it possible then to have thermal scattering of electrons 
and no resistance to electric current? This problem was not solved 
until 1957 when it was proved by means of quantum mechanics that 
electrons in a thin energy layer next to a Fermi surface are able to 
become “paired” thanks to interaction with the thermal vibrations 
of a crystal lattice. 

It transpired that at low temperatures it is advantageous from 
the energy viewpoint for two electrons of equal spin magni- 
tude but opposite spin direction to become “united”. The words 
“paired?” and “united” have been placed in quotation marks 
because calculations indicate that the wave functions of these elec- 
trons extend over a large distance, viz., of the order of 10-* em (the 
size of a crystal grain in an ordinary polycrystalline metal), There- 
fore, the formed pairs should not be viewed as peculiar “molecules”; 
since the bond is implemented over a large distance by means of 


thermal waves. 
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If follows from the theory that all electron pairs are identical in 
the sense that they have the same total momentum. A 

“Matter” consisting of such electron pairs possesses superconduction 
properties. The formation of electron pairs does not eliminate the 
thermal scattering of electrons. Superconduction occurs because the 
scattering of the electrons of a pair ceases to affect the current 
strength. Thermal scattering can only break up one or another pair or 
form a new pair from separate electrons, but the magnitude of the 
current is determined by the total momentum of the electrons, 
which remains unchanged. Thus, according to this model, the ther- 
mal scattering of electrons may produce electric current fluctuations, 
but it cannot stop the current. 

A superconductor contains, in addition to “paired” electrons, an 
ordinary electron gas, i.e., a gas of individual electrons. Thus, in 
a superconductor, there exist, so to speak, two fluids—an ordinary 
fluid and a superconducting one. If the temperature of a superconduc- 
tor begins to rise from absolute zero, thermal motion will break 
up more and more pairs of electrons, i.e., the ratio of the ordinary 
electron gas to the superconducting electron gas will increase. Final- 
ly, the critical ‘temperature is reached and the last electron pairs 
break up. 

The new theory provides a quantitative explanation of all of the 
superconduction phenomena discussed above. 


s 
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Properties. A large group of substances (various elements and 
chemical compounds), the conductivity of which lies in the broad 
interval between those of conductors and insulators, are classified 
under the heading of semiconductors. Since a voltage of 4 volt will 
produce currents of several hundred thousand amperes in a cube of 
metal having a volume of 4 cm’, and currents of the order of 101° 
amp in insulators under the same conditions, one can see that the inter- 
val occupied by semiconductors is extremely large. 

The conductivity of substances in this interval has a number of 
peculiarities which enable us to “recognise” semiconductors. 

First, it should be noted that the dependence of conductivity on 
temperature is opposite to that of metals. The conductivity of semi- 
conductors, in contradistinction to that of metals, may decrease 
rapidly with temperature. At low temperatures, a semiconductor 
may become an insulator. The resistance of most semiconductors 
is considerably more sensitive to changes in temperature than met- 
als. Compact temperature meters of high sensitivity may be con- 
structed using semiconducting thermal resistors (thermistors). 


44—1409 


p 
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A second important feature of semiconductors is that in a number 
of cases they may possess positive (p) as well as negative (x) conduc- 
tivity. The terms positive and negative are used in the following 
sense: if the current is due to the motion of positive charges the con- 
ductivity is called positive and if it is due to the motion of negative 
charges it is called negative. Thus, metals have negative conductiv- 
ity since the current is due to the motion of electrons. Both kinds 
of conductivity occur in semiconductors. This effect seemed strange 
at first since the flow of 
current in a semiconduc- 
tor is not associated (as 
in an electrolyte) with the 
displacement of ions, and 
the question of the na- 
ture of positive carriers of 
current remained open for 
a period of time. 

The sign of the current 
carrier may be deter- 
mined in a number of 
ways. Let us examine the 
most convincing evidence, 
which is based on a study 
of the forces exerted by 
a magnetic field on current-carrying particles (Hall effect). If an 
electric current flows along a plate which is placed perpendicular to 
magnetic lines of force then, on a charged particle e moving with a 
velocity u, a force ¥ will be exerted in a direction perpendicular 
to the field and current (see Fig. 300). In other words, an electric 
field of intensity Æ = uB will be created in such a direction (see p. 297). 

A potential difference U = uBd is produced between the plate 
faces perpendicular to the created field. The sign of this potential 
difference is determined by the sign of the charge carrier. 


Fig. 300 


Example. Consider a 1 X 2 X 0.5 cm? semiconducting plate in a magnetic 
field B = 1,000 gausses. Assume that the conductivity o of the plate is equal 
to 3 ohm™ cm™ (zinc oxide). If a potential difference of 1 volt is applied be- 
tween the plate faces which are separated by a distance of 2.cm, an electric 


current of density j = oE = 3 Xx = = 1.5 a/cm? will flow through the plate. 


Experiments show that a potential difference U = 0.12 mv is produced 
between the lateral surfaces of the plate. The sign of the Hall effect (see 
Fig. 300) indicates that the charge carriers are electrons. The velocity of their 
ordered motion is 


“U 0.12 x 10-3 v 


Bd 0.4 v ser 
m 


u 0.12 m/sec =12 cm/sec. 


X 10-2m 


w 
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Note that this velocity is more than 1,000 times as great as the velocity of the 
ordered motion of conduction electrons in a metal (see the example on p. 686). 
The number of conduction electrons per unit volume of semiconductor is 


a 1.5 a/em2 
“eu 1.6% 10-19C x 12 cm/sec 


=8x 1017 cm8. 


n 


The low value of o is due to the fact that this x is about a millionth of that 
of a metal. 


A final important feature of semiconductors is their extreme sen- 
sitivity to impurities, which not only affect their conductivity (an 
impurity of the order of one per cent may change the conductivity 
of a semiconductor by a factor of a million or more), but may 
change n-conductivity into p-conductivity and vice versa. 

The most important semiconductors already of practical signifi- 
cance include germanium, silicon, selenium, antimony alloys (con- 
taining indium, cadmium and zinc), and copper and titanium oxides. 

Interpretation of Properties. Most characteristics of semiconduc- 
tors can be easily explained by means of an energy level diagram. 
Insulators have a filled energy band. The next unfilled band is sep- 
arated from the filled band by a large energy gap. Imagine that the 
system of levels of a substance is such that the gap between these 
bands decreases and the energy of thermal motion suffices to transfer 
electrons from the filled -band to the unfilled band. Such a substance 
will act as a natural semiconductor. 

At a given temperature, the number of electrons in the upper band 
will be determined by the dynamic equilibrium conditions estab- 
lished between bands. Electrons continuously pass from the lower 
band to an excited state and vice versa. Just as in the case of a sat- 
urated vapour, equilibrium will prevail when the number of elec- 
trons moving “upwards” equals the number of electrons moving 
“downwards”. 

Again as in the case of a saturated vapour, when the temperature 
is raised the equilibrium is displaced in the direction of the upper 
level, i.e., the instantaneous concentration of electrons in the upper 
band increases. The concentration of free electrons rises sharply 
with the increase in gap between bands. The probability of sur- 
mounting an energy barrier of width Ag invariably appears as an 
exponential factor. The approximate concentration of electrons in 
the upper band at a temperature 7 may be determined from the 


A 

formula n ~ 10% X e 2AT, 
If a body has a gap A€ which is significantly greater than kT, it 
belongs under the heading of insulators. For this purpose, it is suf- 
ficient for € to be 100-200 times as great as kT. At room tempera- 


44% 
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ture, kT ~ oy ev. When A@ becomes less than 1 ev, i.e., about 
40 times as great as ÆT, the number of electrons in the upper band 
will be sufficient to create measurable currents. When Ag is of 
the order of several tenths of an electron-volt, a semiconductor pos- 
sesses considerable conductivity. 


Reference to the conductivity formula o = 2} shows that when 
y mv 


the temperature of a semiconductor changes two factors on which 
o depends also change. In the first place, as the temperature increa- 
ses, the number of free electrons 
< Field n increases, but, as previously, 
—> Movement of electrons the free range l decreases With 
increasing temperature. However, 
experiments show that usually 
the former effect overshadows the 
latter. 

Until now, we have discussed 
the conduction properties of the 
b) upper band, but have disregarded 
the lower band, which should 
also acquire conduction proper- 
ties since vacancies are formed in 
it as the result of the transition of electrons to the upper band. This 

conductivity may be very peculiar in nature. 

The creation of conductivity in the upper, partially filled band 
may be interpreted as a displacement in the distribution of electrons 
in momentum space in the direction of the field (to the right in 
Fig. 301a). However, this is not the only way in which ordered motion 
of electrons may occur. Imagine that the overall shape of the 
distribution of electrons does not change (see Fig. 301b). However, 
now one electron, and now another, close to the Fermi surface is 
snatched away and a “hole” is formed in momentum space. Under 
the action of the field, such a hole is immediately filled by a neigh- 
bouring electron moving from left to right (in the same direction asin 
the other diagram). The hole is displaced from right to left. Now, 
another point representing an electron in momentum space occupies 
this position and in this manner-the hole moves in the opposite 
direction to that of the electrons. Since holes are formed continually, 
a continuous positive “hole” current flows. 

Thus, in a natural semiconductor, electric current may be viewed 
as the result of the motion of “holes” in the occupied band as well 
as of electrons in the upper band. However, the major role in such 
cases is played by the motion of electrons in the conduction 
band. 


Fig. 301 
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This natural conduction of semiconductors is encountered consid- 
erably less frequently than another phenomenon, namely, semi- 
conduction properties under the action of a small- percentage of 
impurities. The role of foreign atoms or other lattice imperfections 
consists in their contribution to the system of energy levels. Fre- 
quently, imperfections create their own level—a narrow energy band 
between the filled and unfilled bands. 

Let us assume that the foreign atoms contribute “surplus” elec- 
trons which occupy a narrow band between the main bands. When the 
temperature increases, 
electrons pass from the 
level of the impuri- 
ties to the conduction 
band in greater and great- 
er numbers, i.e., conduc- 
tion increases. Such a 
semiconductor yields n- 
conduction. A point may 
be reached (for a low per- 
centage of impurities) at 
which all surplus elec- 
trons are given up. A fur- 
ther increase in tempera- 
ture will not result in an 
increase in conduction 
and from then on the 
body will act like a met- 
al. Such behaviour may 
be detected when penta- Fig. 302 
valent arsenic or . phos- 
phorus atoms are introduced into a lattice of quadrivalent silicon 
or germanium. Fig. 302 shows a simplified diagram of the crystal 
lattice of silicon. If a silicon atom is replaced by an arsenic atom, a 
“surplus” electron is obtained. This will be a conduction electron. 

It is interesting that impurities may result in p-conduction. This 
occurs when an impurity atom has acceptor properties, i.e., can 
attract electrons. Electrons pass from the filled band to the inter- 
mediate level of the impurities and as a result hole conduction occurs 
in the filled band. Such conduction occurs in silicon containing 
a trivalent aluminium impurity. If at a number of lattice sites 
silicon is replaced by aluminium, “electron-deficient sites” will 
exist in the crystal. When a field is applied, an aluminium atom may 
attract an electron from a neighbouring silicon atom; the electron 
falls under the action of the electric field and a “hole” moves in the 


opposite direction. 
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It should be noted that such “unsophisticated” models of the dis- 
placement of an electron are greatly oversimplified in veiw of the 
fact that the motion of electrons in a solid satisfies the laws of quan- 
tum mechanics. 

By adding one or another impurity, we are able to vary, the con- 
ductivity of materials within very broad limits. We may change 
P-type conductivity into n-type conductivity and may significantly 
change the nature of the temperature dependence of the conductivity. 


273. Emission of Electrons 


Work Function of an Electron. Electrons in a conduction band 
behave like an electron gas. The surface of a solid acts as the “walls” 
of the vessel in which this gas is 
zocated. To leave the bounds of 
this surface, an electron must 
surmount a potential barrier the 
height of which is designated by 
6. At absolute zero, electrons 
have a limiting energy W. In the 
model of an electron gas, W cor- 
responds to a Fermi surface. This 
is the energy of electrons which 
at absolute zero are located at 
the highest level. Thus, in order for an electron to surmount the 
potential barrier, it is not necessary to impart to it an energy @; 
it is sufficient to give it an additional energy 

A=6@—W. 


The quantity A is called the work function and 4 = @ the potential 
q e 


function, i.e., the work function expressed in volts (see Fig. 303). 

An electron’s escape from a metal is impeded by the forces of 
attraction exerted by positive ions as well as the force of attraction 
between the electron and its electrical image. The latter force is 


equal to A , Where z is the distance of the electron from the surface. 


This force is capable of holding an electron at a considerable distance 
from the surface and thus forming a layer or electron cloud close to 
the surface of the body. 

If the metal is located in an electric field, the work function de- 
creases by an amount eV eE, where e is the electron charge and Æ is 
the field intensity. The intensity of the external field must equal 
0.2 xX 10? v/cm if the potential function is to decrease by 1v. (Thus, 
in ordinary electronic instruments the external field has little effect 
on the work function.) 


Fig. 303 
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The work function is very sensitive to changes in surface properties. 
It transpired that it is possible to deposit electrically positive atoms 
or ions (metals, oxygen) on the surface of a cathode. In this manner, 
a layer of positive charge may form on such a surface. Thus, as may 
be seen from the following example, the work function can be con- 
siderably reduced. i 

The filament of the heater in electron tubes is usually made of 
tungsten (work function A = 4.9 ev). By coating the surface: of the 
tungsten with a layer of an oxide 
of an alkaline earth metal (Ca, 
Ba, Sr), we may decrease the work 
function to 1.5-2 ev. This enables 
us to obtain considerably greater 
emission at lower filament tem- 
peratures. 

Thermionic Emission. The escape 
of electrons from a metal upon 
heating is known as thermionic emis- 
sion. This phenomenon is basic to 
the action of heater-cathode tubes. 
When the temperature is raised, 
electrons are excited, some of them 
acquring a sufficient velocity in the 
direction perpendicular to the sur- 
face of the material to surmount the potential barrier ĝ. 

An electron gas obeys Fermi-Dirac statistics, according to which 
the number of electrons having an energy ĝ is proportional to the 


z 4 ; : š 
expression -pw - But we are interested in the energies 6 which 
e kT +1 
are considerably greater than the zero-level energy W. Therefore, it 
is accurate enough to reduce the above expression to 
WE Cree ee 
SRT e AT 


Fig. 304 


Thus, we may determine the number of electrons having an energy 
equal to the height of the potential well. It may be rigorously proved » 
that the thermionic emission current is proportional to this expres- 
sion. We see from the formula that thermionic current increases ex- 
tremely rapidly with temperature. 

The circuit shown in Fig. 304 may be used to measure the thermionic 
current. By increasing the voltage, we quickly reach saturation 
current. (This is the thermionic current referred to above.) The ini- 
tial portion of the current-voltage curve is space-charge limited 
(see above). The voltage draws electrons to the plate from the elec- 


ee 
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tron cloud and the cathode emits enough electrons to maintain the 
cloud in equilibrium. In the absence of an external voltage, this 
equilibrium is determined by the electron emission of l he cathode 
and the negative potential of the cloud. As the voltage is increased, 
the electron cloud begins to dissipate and emission increases until 
the voltage draws all electrons of the cloud to the plate. At this 
point current saturation sets in. 


Rigorous analysis yields the following relationship between electronic 
current and temperature: 


ep 
İ=4AT2e *T (Richardson formula). 


For tungsten, A = 75 a/cem2 deg? and ep = 4.5 ev. Let us compare the 
current density of thermionic emission form tungsten at 500°K and 2,000°K. 
At 500°K, 
4.5x1.6x10712 
ae SE SIO Sd 


[= 75 x 25 x 104 x@ 1-38X101X500 _ 10-36 aroma, 


In other words, to obtain measurable currents, cathodes of impracticable 


dimensions (greater than that of the entire land mass of the globe) would be 
required. 


At 2,000°K, 
_5.21x104 
T=75X4X108xe 200 dg g ma/cm2, 
Such a current is easily measured, but the are 
too large for most practical purposes. 
The picture changes when tungsten is coated with caesium. Now, A = 
= 3.2 a/cm?deg? but ep = 1.36 ev. At T= 2,000°K , 
_1.57x104 
T=3.2X4X108 xe 2000 4 gy age a/em?. 
Clearly, such current densities would destroy the cath 


required values of current density, 7 <1 acm? 
peratures (~1,300°K). 


a of the emitting surface is stil] 


ode. Therefore, the 
» are attained at lower tem- 


Secondary Emission. This refers to emission due to the dislodge- 
ment of electrons from a metal under the action of other electrons. 
Secondary electrons may emerge, taking the same direction as the 
primary electrons. This shows that primary electrons interact with 
bound electrons; otherwise, the law of conservation of momentum 
would be violated. Secondary emission begins at a primary electron 
energy of the order of 10 ev. Most secondary electrons have an energy 
of several electron-volts—their energy distribution js practically 
independent of the energy of the primary electrons. . 

Primary electrons produce secondary electrons and, in addition, 
are elastically scattered. The remarkable phenomenon of one pri- 
mary electron producing several secondary electrons has wide appli- 
cation (e.g., Kubetsky tube). 
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Extrinsic PhotoelYeet. This phenomenon may be studied by mak- 
ing the cathode of a vacuum tube out of the material under inves- 
tigation. Light falling on the cathode dislodges electrons from the 
material. Electrons reaching the anode generate a photoelectric 
current, the magnilude of which may be studied as a function of 
external conditions. 

It is the total current taken from the cathode, of course, that is 
characteristic of the substance. Here too, therefore, we usually oper- 
ate under conditions of saturation current. If no voltage is applied 
across the photocell, a weak current generated by electrons leaving 
the cathode in the direction of the anode flows through the instru- 
ment. A weak accelerating voltage is not sufficient to attract all the 
electrons, but at a certain voltage all the electrons reach the anode 
and saturation current is obtained. z 

Experiments show that the photocurrent is strictly proportional 
to the intensity of the incident light. This is true of light of any 
frequency which produces a photoeffect. 

Moreover, the number of dislodged electrons is exactly equal to 
the number of photons. One photon may dislodge only one electron. 
It is not possible for a photon to dislodge several electrons from 
a substance by a series of energy losses. This important postulate is 
somewhat difficult to prove in a study of the extrinsic photoeffect, 
since the extrinsic photoeffect may be accompanied by an intrinsic 
photoeffect (see below) in which some of the electrons do not leave 
the bounds of the substance. 

The law of conservation of energy and the law of conservation of 
momentum are obeyed in the interaction of a photon and an elec- 
tron. The law of conservation of energy (the Einstein equation) takes 


the form 
mv? 


hý= Spe eR 


where ¢p is the potential function of the electron for the metal (the 
same as in the thermionic emission experiments). In accordance with 
the law of conservation of momentum, it may be assumed that the 
lattice of the metal takes part in the photoelectron interaction proc- 
ess (otherwise electrons could move only in the same direction as 
the photons). 

A photon may produce a photoeffect if its energy is not less than 
the work function. It follows that each material has a photoeffect 


limit. The limiting frequency vo is equal to to and the limiting 
wavelength Ao (in millimicrons)—the “red” limit of the photoeffect — 


a aa 
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poms 298 , where @ is expressed in volts. No photoeffect will 

ep 


take place when a substance is irradiated with light of long wave- 

length. The “red” limit of the photoeffect in the case of Li is Aj = 

= 560 millimicrons (5,600 A), i.e., it is in the yellow region of the 

visible spectrum; in the case of Cu, 4) = 300 millimicrons (3,000 A), 

i.e., the ultraviolet region; and in 

V% the case of Al, Ao = 410 millimic- 

rons (4,100 A), i.e., the violet re- 

gion of the visible spectrum. 

h If the energy of a photon is great- 

arctan © er than the work function, the 

surplus goes into the kinetic energy 

of the electron. Thus, hard radia- 

tion can produce very fast photoe- 
g lectrons. 

š ™liIn order to measure exactly the 

Fig. 305 limiting frequency and the work 

function, we use a retarding poten- 

tial method. A small bias voltage is applied to the photocell (plus 

terminal is connected to the photocathode) and this voltage is in- 

creased until the current cuts off. This point is reached when 

V= . In this manner, we may determine experimentally the de- 
pendence of V, on the frequency of light: 


hy 
Vo aaa p. 


The plotted curve is a straight line the slope of which is equal to 
the universal constant + . The limiting frequency vo and the poten- 
tial function @ are obtained as intercepts along the abscissa and ordi- 
nate, respectively (see Fig. 305). 

Example. If soft X-rays of wavelength 4 = 100 A fall on a copper plate 
(p = 4.1 v), the bias voltage cuts off the photocurrent when 


x 1010 K 
eVo =hy—ep=6.6 x 10-7 3X1 x] rata 4.1 ev=120 ev. 


Therefore, the bias voltage will’ equal 120 v. 


Another important characteristic of a photocathode material is 
the spectral dependence of the photocathode. Here, no simple rela- 
tionship may be applied. The curve begins at the limiting frequency 
and in many cases increases rather uniformly; one can say that the 
coefficient of utilisation of photons increases with photon energy- 
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However, in other cases, the spectral curves have well defined maxima 
which lie within a rather narrow spectral band. 

Photocells which utilise the extrinsic photoeffect have broad appli- 
cation. They are used in photo relays, television and cinema sound 
tracks. Silver, caesium and potassium may serve as photocathodes; 
antimony-caesium cathodes are widely employed. : 

In various photo relay applications, the photocurrent need not be 
proportional to the intensity of light, but the sensitivity of the 
photocell should be high. In such cases, gas photocells instead of 
vacuum photocells may be used. This increases the sensitivity tens 
of times. 

Intrinsic Photoeffect. When the action of a photon results in the 
displacement of an electron from a filled band to a level of an impu- 
rity or to a conduction level, it is referred to as the intrinsic photo- 
effect. Under the action of light, this phenomenon may produce con- 
duction electrons and holes in a body. Such conduction electrons and 
holes will occur in pairs. Strictly speaking there will be a pair of 
charges for each photon. The phenomenon is extremely complicated 
by secondary processes which occur within a body as a consequence 
of the recombination of electrons and holes. 

It is clear, therefore, that the intrinsic photoeffect is a phenomenon 
which is particularly. characteristic of semiconductors, but which 
may also occur in insulators. 

Semiconductors which possess this effect are included in current 
circuits as photo resistors. In the dark, such a body has very low 
(dark) conduction. Its conduction increases when subjected to light. 
Energies of several tenths of an electron-volt may be sufficient to 
produce intrinsic electron transitions. Therefore, the threshold of 
the intrinsic photoeffect may lie in the far infrared region. 

Photo resistors are widely used in signal systems and in automation 
when it is necessary to amplify or detect very small changes in 


light intensity. 
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mber of cases, the junction between a metal and a semicon- 
ductor or between two semiconductors may have a rectifying action. 
Even though the boundary between two bodies in close contact (weld- 
ed or fused) is very narrow, nevertheless it is of finite width; hence 
the designation barrier layer. Such a layer may form at the junction 
between copper and cuprous oxide or at the junction between sele- 
nium and cadmium selenide. Investigations indicate that a barrier 
layer between two semiconductors is formed when one of the semi- 
conductors is a p-type conductor and the other an n-type. Such 


barrier layers are called p-n junctions. 


In anu 


“= 
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Fig. 306 illustrates the rectification provided by barrier layers. 
The figure shows a typical current-voltage curve. The right branch 
of the curve is the characteristic for the forward current and the 
Jeft branch for the reverse current. The forward current increases 
rapidly with increasing voltage, but the reverse current remains 
almost constant and has a very low value. 

Which is the direction of forward current? Investigations indi- 
cate that at a p-n junction the forward current flows from the p-type 

semiconductor, through the junction, 

aa) to the n-type semiconductor. This 
means that holes move toward elec- 
trons and electrons move toward 
the higher concentration of holes. 
Reverse current flows when the 
holes and electrons move away from 
the junction. 

In the case of a metal-semicon- 
ductor junction, the situation may 
be described as follows. If the metal 

-yv +V forms a junction with a p-type sem- 

=i iconductor (copper-cuprous oxide), 

; the forward current will flow from 

Fig. 306 the cuprous oxide to the copper. 

i This is to be expected: There are 

no free electrons in a semiconductor, but in a metal there are 

an excess of such electrons; therefore, electrons move from the metal 
to the semiconductor. . 

The characteristics of barrier layers find wide application in indus- 
trial rectifiers. Copper-oxide rectifiers (copper-cuprous oxide) and 
selenium rectifiers have been produced for a long time. During recent 
years, tiny germanium and _-silicon rectifiers—crystal diodes—have 
been widely introduced. The introduction of impurities into ger- 
manium or silicon may transform them into p-type or n-type con- 
ductors. 

A crystal diode consists of a very small germanium (or silicon) 
crystal, one part of which contains an acceptor-type impurity and 
the other a donor-type impurity. ` 

Also of interest are crystal triodes, representing a semiconductor 
system of the p-n-p or n-p-n type. If a wire is soldered to each of the 
three sections of such a tiny triode (the dimensions of crystal “tubes” 
are of the order of a centimetre), the system may be connected in 
a circuit just as an ordinary triode tube. A voltage is applied across 
the two outer ends, one end serving as the anode and the other as 
the cathode. The third tap serves as the grid. Such a system of semi- 
conductors has two barrier layers connected in opposition. This 
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is why it behaves like a triode tube. The analogous operation of 
-ystal triodes and ordinary triodes is illustrated in Fig. 307. 

Another important use of barrier layers is in the manufacture of 
photocells, which operate without a voltage source. Coating a semi- 
conductor with a thin ‘layer of metal will produce a barrier layer. 
A layer of metal may be made so thin that light easily passes through 
it. When light passes through the metal, an intrinsic photoeffect 
is produced in the semiconductor. The presence of a barrier layer 


Collector 
N 


Vacuum triode 


Semiconductor 
triode 
Fig. 307 


sauses the liberated electrons to move in a definite direction. An 
electric current will flow when the circuit is closed. 

Copper oxide and selenium photocells are manufactured on the 
basis of this principle. Sulphur-thallium photocells, having a short- 
circuit current of the order of 10,000 microamperes per lumen, are 
at present being used in the Soviet Union. They have an efficiency 
of transformation of light energy into electrical energy of the order 
of 4 per cent. Here too silicon and germanium p-n barrier layers 
are of fundamental significance. They enable us to manufacture 
photocells with an efficiency of the order of 10 per cent. This new 
discovery has placed the utilisation of solar energy on a practi- 


cal basis. 
276. Contact Potential 


When two metals or semiconductors are in contact, there arises 
a difference of potential between them. This difference is known as 
contact potential. To measure this difference, we must remove 
inclusions, oxide films, etc., and make close contact between the 
surfaces of two such bodies by soldering, welding or pregrinding. 
In this manner, we may form a circuit that is broken in one place. 
Since all points of the body are at the same potential, the contact 
potential may be determined by measuring the field in the gap. 
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The order of magnitude of the contact potential between two bod- 
ies is equal to several tenths of a volt. 

The existence of contact potential is easily explained. It is due 
to the difference in work functions of the two adjacent bodies. It 
should be recalled that the work function Æ is equal to the differ- 
ence between the energy W of an electron at the highest level inside 
a metal (at absolute zero) and the energy of an electron escaping 
from the metal with zero kinetic energy. 

The energy distribution of electrons in the case of two solids 
placed in contact can be represented as shown in Fig. 308. The 
upper level is the same for all 
bodies. The electrons of the body 
having a lower work function 
are at a higher energy level. 
Thus, the conditions are present 
-for the transition of electrons 
from the first body to the second. 
This transition is accompanied by 
the formation of positive charge on the first body and negative charge 
on the second. At the point of contact, there arises an electric 
field which impedes the transition of electrons. Finally, equilib- 
rium will be established at a particular potential difference char- 
acteristic of the given pair of metals. ' 

This picture depends little on temperature. As the temperature is 
raised, the energy distribution boundary of the electrons is no long- 
er so sharply defined. Electrons appear at higher levels, but the 
conditions for the transition of electrons remain basically the same 
thanks to a lack of close dependence of the energy of the electrons 
on temperature. 

It is evident from the above description of this phenomenon that 
any group of solids may be arranged in a definite sequence such that 
each member of the sequence becomes positively charged with 
respect to the next. Such a series was first obtained by Volta, the 
discoverer of contact potential. From the above explanation of 
this phenomenon, it is clear that a Volta series corresponds to rising 
work functions, i.e., the motion of electrons between two bodies 
is in the direction of the one having a higher work function. 

Since the contact potential p2 between two bodies may be ex- 
pressed in the form of a difference of work functions, 


Fig. 308 


1 
P12 = T (Ay — 4), 


it is evident that the potential difference between two bodies may 
be expressed as the difference of the contact potentials between 


U 
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each of these bodies and a third: 
1 1 
P => (41—42), P13 => (Ai — As); 


1 
P23 z (Ae As) = P21 — Ps1- 


Furthermore, it is evident that in a closed circuit consisting of 
any number of bodies the total contact potential is equal to zero: 
Piz + P23 + Pa = 0. No current will flow in such a circuit. 


Electromotive Series 
(normal potential in an electrolyte solution, volts) 


Li «Ga ‘Nagi, Al zg Ro Ni Ph Cu Hg Ag_ Pt 
—3.01 —2.84 —2.71 —1.66 —0.76 —0.44 —0.23 —0.12 -+0.34 -+0.70 +0.80 +14.2 


277. Charge Distribution in a Nonuniformly 
: Heated Body 


Let us consider a rod along which a drop in temperature occurs. 
Different portions of the rod will be subject to different conditions, 
and this will affect the behaviour of the free electric charges. Where 
the temperature is higher the charges will have greater energy; 
moreover, the number of free charges may increase if electrons can 
pass from the filled band to the conduction band. Both effects tend 
to produce diffusion of free charges, which continues untill a field 
which counterbalances the tendency to uniform distribution is creat- 
ed. A drop in potential will occur along the rod; negative charge is 
formed at one end of the rod and positive charge at the other. Each 
body has its own characteristic curve of potential drop as a function 
of temperature. The rate of fall of potential may be described by the 
derivative of the potential with respect to temperature: 


If a constant temperature difference is maintained between the 
ends of a rod, heat is transferred continuously through the rod. 
Heat is transferred by free charges, but current cannot flow in an 
open circuit. The continuous transfer of energy without the transfer 
of charge is achieved thanks to the different velocity of the charges 
moving from the hot end to the cold end, while the number of charges 
passing through a given cross-section per unit time is the same in 
each direction. If electrons are the carriers of current, an excess con- 
centration of them is produced at the cold end of the rod. If positive 
particles or holes are the carriers of current, positive charge accumu- 
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lates at the cold end. Thus, the sign of the potential difference will 
differ, depending on the sign of the current carriers. 

Will the above effect obtain in the case of a semiconductor, partic- 
ularly one in which holes as well as electrons are the carriers of 
current? As a matter of fact, such bilateral diffusion may reduce to 
zero the potential difference of a nonuniformly heated body. However, 
the potential differences formed by positive and negative current 
carriers may not balance each other. This may occur as a result of 
a difference in mobility between electrons and holes, and also as 
a result of differences in their concentrations. 

Certain difficulties occur in detecting potential differences in 
a nonuniformly heated conductor. The order of magnitude of such 
a potential drop is 10~* v/deg. This effect cannot be detected, of 
course, by forming a closed circuit of the conductor in the hope of 
measuring electric current. Such a closed circuit may be conceived 
of as divided into two halves: in one half a potential drop occurs 
and in the other a potential rise. In a uniform conductor, the mag- 
nitudes. of these two potentials will be exactly equal; hence the 
emf we wish oaas ire will not be detected. 


278. Thermoelectromotive Force 


Electric current will flow in a wire ring consisting of two (or more) 
different materials if the junctions have different temperatures. 
This is the well-known thermoelectric effect, which has found broad 
practical application. 

There are two possible reasons for the flow of a thermoelectric 
current. First, it is evident that the potential drops along the two 
wires due to temperature drop may differ if the values of the constant 


d y 2 
a = differ for the two materials (we shall designate them as 


I and II). 
Ta Ti: 
Thus, the potential differences \ ar dT and \ an dT generally 


Ty T2 

are not equal. This alone would be sufficient for an emf equal to 
the difference between these voltages to arise in the wire ring. 

The second reason for thermoelectric current lies in the fact that 
contact potential quite probably depends on temperature. If the 
two junctions are placed at different temperatures, their contact 
potentials may differ. Again, this condition alone would be suffi- 
cient for a net potential difference to exist in the closed circuit and, 
hence, for a current to flow. 

Taking both phenomena into account, we may express the ther- 
moelectromotive force as the sum of the voltage drop in the first 


/ 


$ 
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wire, the jump in potential from the first wire to the second, the 
potential drop in the second wire, and the jump in potential from 
the second wire to the starting point of the circuit: 


Ta Ty 
g= í ay dT + [Pu (T2) — Gr (L2)] + \ an dT + [91 (71) — Pu (£1). 
Tı Ta i 


To simplify the above expression, let us write the difference 
Pir (12) — Pir (Tı) in the form 


and the analogous second difference in a similar form. Now, the for- 
mula for the emf assumes the form 


Thus, we have succeeded in expressing & in the form of a difference 
between two quantities, each of which is characteristic of a given 
body. Quite often the term “thermoelectromotive force” is used to 
refer to the emf per degree: 


dp 


Cranes 


rather than to the above integral. This quantity is a fundamental 
characteristic of the thermoelectric properties of a body. It is not 
an invariable constant, for it may depend on thermodynamic condi- 
tions, including the temperature. However, for many bodies this 
dependence is not well defined. 

By measuring the thermoelectromotive force, we may determine 
the difference between the above quantities, but we cannot deter- 
mine a. However, by forming different pairs of conductors and semi- 
conductors, we are able to determine the value of œ relative to 
a material laken as a “base”. Thus, materials may be arranged in 
a series in accordance with their thermoelectromotive forces. For 
reasons which are quite understandable in’ view of what has been 
said, a thermoelectromotive force series does not coincide with the 
corresponding contact potential series. 


Let us list the thermoelectromotive forces of several metals with respect to 
platnum. If a given metal is joined to platinum and one junction is held at 
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0°C while the other is held at 100°C, an emf arises in the closed circuit: 


Antimony + 4.0 mv 
Iron + 41,9)” 
Copper +0.75 7 
Nickel —-1.5 ” 


Constantan — 3.4 ” 


The positive sign indicates that at the 0°-junction current flows from the 
given metal to the platinum. 


Using a table of values of the constant œ, one can calculate the 
thermoelectromotive force occurring for a given temperature differ- 
ence from the expression @ = (4; — a2) (Tı — Ta). This is the 
expression for a thermal element consisting of two metals or two 
semiconductors the current carriers of which have the same sign. 
In this case, the potential differences arising in the two branches of 
the circuit are in phase opposition, and the resulting emf is equal 
to the difference between the effects of the two conductors forming 
the circuit. However, the picture changes when the circuit is formed 
of two semiconductors, one of which possesses hole conductivity 
and the other electron conductivity. In a p-type conductor, holes 
moye toward the cold junction and electrons toward the hot junc- 
tion. In an n-type conductor, electrons move toward the cold junc- 
tion. The two effects reinforce each other and the formula assumes 
the form. 


Ê = (44 +22) (Ty—T2). 


This fact is of great practical significance. 


279. Liberation of Heat in Electrical Circuits 


Joule heat is liberated in a conductor in which current flows. 
The displacement of charges along a body is accompanied by two 
-other thermal effects. 

The first of these, the Peltier effect, consists in the following. If 
electric current passes through a junction between two bodies, heat 
proportional to the current strength is released or absorbed at the 


junction: 
O= 


where TI is a proportionality constant. A remarkable feature of this 
effect is that the sign and magnitude of the heat changes when the 
direction of the electric current changes, i.e., depending on this 
direction, a particular junction will release or absorb heat. This 
was demonstrated by Lenz more than a century ago. A drop of water 
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was placed in a recess at the junction between an antimony rod and 
a bismuth rod. Then, by passing current in one direction, he showed 
that the drop freezes. When the current was reversed, the drop 
melted. 

The second effect occurs in any uniform conductor which is heated 
nonuniformly. Assume, for example, current flows along a rod, one 
end of which is maintained at one temperature and the other end 
at another. In such a conductor additional heat, £ 
proportional to the first power of the current ld 
strength (not to the second power as in the case 
of the Joule effect), will be liberated. To detect 
this effect, we must, of course, reduce Joule heat 
to a minimum. 

This effect, which was predicted by the British 
physicist Thomson on the basis of thermodynamic f | 
considerations, may be demonstrated in the fol- 
lowing manner. Included in the current circuit H C 
are two bars which are placed parallel to each 
other as shown in Fig. 309- The ends of these 
bars, maintained in pairs, are held at different 
temperatures. It would seem, in view of the sym- 
metry of the arrangement, that symmetrical points 
of the bars should have the same temperature. 

However, in one bar the current flows from the 

hot end to the cold end and in the other from Hat 

the cold end to the hot. Owing to the Thomson 

effect, corresponding points of the two bars do 

not have the same temperature. A point of the Fig. 309 

bar in which the current flows from the hot to the 

cold end will be hotter than the corresponding point of the other bar. 
The quantity of heat liberated per second in a segment of length 


dx may be written in the form y 
or 
dsl rane dz, 


where t is a proportionality constant. The greater the temperature 
gradient, the greater the quantity of heat. Three effects exist simul- 
taneously in a thermoelectrical circuit: the appearance of a thermo- 
electromotive force, the Peltier effect and the Thomson effect. On 
the basis of the principles of thermodynamics, it can be proved 
that these three processes are interconnected. This requires no proof 
for weak currents. Since the thermoelectric effects are proportional 
to the first power of the current and Joule heat to the second, the 


Joule heat is negligible in such cases. 
45* 
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Example. The ends of a rod of sodium (t = —8.5 x {076 y ‘deg and p = 
= 5 X 10-6 ohm cm) of 10-cm length and 5-mm? cross-section are maintained 
at temperatures of 300°K and 310°K. 

When a current J = 0.5 ma flows from the hot to the cold end of the rod, 
the heat liberated in the conductor per unit time due to the Thomson effect is 


Qr=tI = l= —8.5 x 1076 (—5 x 10-4) x 1 x 10 < 108 J/sec. 


The minus sign preceding the current indicates that the current flows in the 
direction of decreasing temperature. The teat liberated in the conductor per 
unit time due to the Joule effect is 

ka 
5x102 


QJ=I?R = (5 x 1074)2 x 5 x 10-6 2.5% 10710 J/sec, 


i.e., about m of the value of the Thomson heat. 


Thermodynamic analysis shows that the coefficients «, II and 
T are interrelated as follows: t= a and anm Substitut- 


ing aT for TI in the first relation, we obtain: t= re. The abso- 
lute value of œ may be determined from these equations. 


The Peltier and Thomson effects have the same physical basis as 
thermoelectromotive force. In the final analysis, a thermoelectromo- 
tive force arises from the fact that heat flow transfers electric charges. 
Here, however, we are dealing with phenomena in which a flow of 
electric charges transfers heat. 


280. Applications of the Thermoelectric Efect 


The opportunities for the application of thermocouples as genera- 
tors of electrical energy have increased considerably in recent times. 
Metal thermocouples have a coefficient of efficiency of the order of 
0.5 per cent, but that of a semiconductor thermocouple consisting 
of a hole segment and an electron segment has already reached as 
much as 7-8 per cent. The low efficiency results from irreversible 
losses in the form of Joule heat. If Ro is the resistance of the inter- 
nal portion of the circuit and R that of the external circuit, the 
power delivered to the external resistance (useful power) will equal 

ER 
(R + Ro)? i 
the value of the thermoelectromotive force, we obtain for the power 
of a thermocouple the expression 

2 2 R 
Eel Eg 
The electromotive force of a thermocouple is of the order of several 
tenths of a volt. To obtain a voltage of 120 v, for example, thermo- 


for any electrical circuit; here, 6 is the emf. Substituting 
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couples are connected in series like a battery. If heavy currents are 
required, thermocouples must be connected in parallel. 

Another important application of the thermoeffect, which has 
also become possible as the result of the development of semicon- 
ductor engineering, is the employment of semiconductors as a re- 
frigerator. 

The application of the thermoelectric effect to the measurement 
of temperature is well known and need not be discussed. 

An important and long known field of application of the thermo- 
effect is in the detection of very small amounts of heat. The opportu- 
nities in this field have increased still further as a result of the fact 
that semiconductors yield large thermoelectromotive forces. For 
such purposes, thermocouples connected in series—so-called ther- 
mopiles—are used. Every other junction of a thermopile is cooled 
and the alternate ones are heated. Thermopiles are used to measure 
power levels as low as several ergs per second. However, it is pos- 
sible to lower this limit still further, i.e., to several tenths of an 
erg per second. This is achieved by means of vacuum thermocouples, 
the thermal loses of which are reduced to a minimum. 
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